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5 . 

MUTANT NUCLEIC BINDING ENZYMES AND USE THEREOF IN 
DIAGNOSTIC, DETECTION AND PURIFICATION METHODS 

10 

RELATED APPLICATIONS 

15 

This application is related to U.S. application Serial 
No. 09/347,878 to Chong-Shen Yuan, filed July 6, 1999, entitled 
"COMPOSITIONS AND METHODS FOR ASSAYING ANALYTES" and U.S. 
application Serial No. 09/457,205 to Chong-Shen Yuan, filed 
20 December 6, 1999, entitled "COMPOSITIONS AND METHODS FOR 
ASSAYING ANALYTES." U.S. application Serial No. 09/457,205 is a 
continuation-in-part application of U.S. Patent Application Serial No. 



wo 01/62968 



PCT/USOl/00452 



09/347,878, fjled July 6, 1999, now pending. The contents of each of 
these applications is incorporated herein in its entirety. 

FIELD OF THE INVENTION 

5 Methods for detecting nucleic acids that contain any abnormal 

base-pairing in a nucleic acid duplex are provided. The methods are 
particularly useful for prognosis and diagnosis of diseases, disorders and 
pathogenic infections and for detection of nucleic acid polymorphisms. 
Also provided are mutant nucleic acid binding enzymes, particularly repair 
10 enzymes, that retain binding specificity and affinity, but lack catalytic 
activity. Combinations, kits and articles of manufactures that contain 
these mutant enzymes are also provided, 

BACKGROUND OF THE INVENTION 

1 5 In the wake of the human genome project, future medical practice 

will use more and more human genetic information for disease prognosis, 
diagnosis and prevention. The need for rapid and accurate methods of 
genetic variation detection are escalating. It is th^se nucleic acid 
mutation detection technologies that will ultimately help to reveal the 
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relation between hunnan genetic nnakeup and diseases. Although 
methods are available for detecting DNA nnutations/polymorphisms, none 
is suitable for use in a high throughput format for detecting large numbers 
of mutations/polymorphisms simultaneously in a single assay format. 
5 This lack of suitability derives from the requisite use of specific 

probes for detecting mutations in the target nucleic acids. For example, 
PCR-restriction fragment length polymorphism (PCR-RFLP) (see, 
e.sr.,Bashiruddin, Methods Mol. Biol., 104 :167-78 (1998); Hyland et al., 
Transfus. Med. Rev., 9141:289-301 (1995); Gasser and Chilton, Acta, 

10 Trop., 59(1):31-40 (1995); and Pourzand and Cerutti, Mutat Res., 
288X11:1 1 3-21 (1 993)), hot only requires the design of target-specific 
probes, but also involves a gel-electrophoresis step to analyze the DNA 
digestion patterns in comparison with the wild type gene. It is a time 
consuming and expensive procedure. Similar problem exists with other 

1 5 methods such as single-strand conformation polymorphism (PCR-SSCP) 
detection, which also requires specific probes and gel-electrophoresis 
(Hayashi and Yandell, Hum. Mutat, 2(51:338-46 (1993); Hayashi, Genet 
Anal. Tech. AppL, 9(3) :73>9 (1992); and Hayashi, PCR Methods AppL, 
1111:34-8 (1991)). Methods, such as the Invader™ assay (Third Wave 
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Technologies, Inc.) for detection of polymorphism based on the use of 
Cleavase enzymes to cleave a complex formed by hybridization of 
overlapping oligonucleotide probes (Marshall et a!., J. Clin. Microbiol., 
35(1 2) :31 56-62 (1997)). eliminates the gel-electrophoresis step, but the 
5 method requires more probes specific for the genes to be tested. 
Moreover, the tnvader^^ assay method works only when the exact 
mutation and mutation position are known. Therefore, it is difficult to 
automate this method for detecting large number of genes in a single 
format. 

1 0 Therefore, a need to develop nucleic acid detection and mapping 

methods amenable to high throughput formats. Thus, it is an object 
herein to provide a nucleic acid mutation detecting method that requires 
neither specific probes nor gel-electrophoresis. It is another object herein 
to provide a nucleic acid, mutation detecting method that is amendable to 

1 5 automation for simultaneous detection of large numbers of nucleic acid 
mutations. 
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SUMMARY 

Provided herein are nucleic acid mutation detecting methods that 
meet the above-noted objectives. These methods have wide application 
in various areas such as prognosis and diagnosis of diseases, disorders or 
5 pathological infections, and selectively binding, such as for removal or 
purification, nucleic acid duplexes that include abnormal base-pairings in a 
population of nucleic acid duplexes. 

The nucleic acid mutation detecting methods provided herein use 
mutant nucleic acid binding enzymes, such as mutant repair enzymes, 

10 and other enzymes that specifically bind to abnormal base pairs, such as 
base-pair mismatch, a base insertion, a base deletion and a pyrimidine 
dimer. The mutant enzymes substantially retain the specific binding 
affinities for abnormal base-pairings of the wild-type enzymes but have 
reduced or lack the catalytic activities. The mutant enzymes thus act like 

15 an antibody (herein designated a pseudo-antibody) and specifically bind to 
abnormal base-pairings in a duplex. The mutant enzymes are enzymes, 
such as repair enzymes, particularly DNA repair enzymes, that typically 
bind to a abnormally matched base pairs, such as base-pair mismatches, 
base insertions, a base deletions and pyrimidine dimers, and then 

20 catalytically repair the duplex. Methods of detection, diagnosis and 
other methods that rely on the affinity of the mutant enzymes for 
duplexes with abnormal base pairings, such as mismatches, are provided. 

Among the methods provided, are methods for identifying and 
25 quantifying mutations. These methods are based upon the specificity of 
the mutant enzyme for a particularly abnormal base pairing. Hybridizing 
perfectly matched nucleic acid strands, forms a nucleic acid duplex 
without any abnormal base-pairings and hybridizing imperfectly matched 
nucleic acid strands forms a nucleic acid duplex with one or more 
30 abnormal base-pairings. By contacting the formed nucleic acid duplex 
with one or more mutant repair enzyme(s), the duplex containing 
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abnormal base-pairing(s) binds to the mutant repair enzyme. Detection 
and quantitation of the complex formed between the nucleic acid duplex 
with th.e one or more abnormal base-pairings and the mutant DNA repair 
enzyme leads to identification and quantitation of nucleic^acid mutations. 
5 Hence, provided herein is a method for detecting abnormal base- 

pairing in a nucleic acid duplex by contacting a nucleic acid duplex having 
or suspected of having an abnormal base-pairing with a mutant DNA 
repair enzyme or complex thereof that has binding affinity for the * 
abnormal base-pairing in the duplex but has attenuated catalytic activity; 

10 and then detecting binding between the nucleic acid duplex and the 
mutant DNA repair enzyme or complex thereof. The amount of mutant 
enzyme bound is used to assess the presence or quantity of the abnormal 
base-pairing in the duplex. 

The nucleic acid duplex that is assayed includes DNA:DNA, 

15 DNA:RNA and RNA:RNA duplexes. Preferably, the nucleic acid duplex to 
be assayed is a DNA:DNA duplex. 

The abnormal base-pairing that is detected can be, for example, a 
base-pair mismatch, a base insertion, a base deletion or a pyrimidine 
dimer. Among the preferred uses of the mutant enzymes is for detection 

20 of a single base-pair mismatch. Such mismatches include, but are not 
limited to, A:A, A:C, A:G, C:C, C:T, G:G, G:T, T:T, C:U, G:U, J:U, U:U, 
5-formyluracil (fU):G, 7,8-dihydro-8-oxo-guanine {8-oxoG):C, 8-oxoG:A 
and any combination thereof. Also preferably, the base insertion or base 
deletion to be detected is a single base insertion or deletion. For 

25 example, the base insertion or base deletion resulting in a single-stranded 
loop containing about 1-5 bases or a loop containing rriore than 5 bases 
can be detected. 

Mutant DNA repair enzyme or complexes thereof that can be used 
in these methods include a mutant of any nucleic acid repair enzyme (or 

30 enzyme complex) as. long as the mutant retains its ability to specifically 
bind to the nucleic acid that the wild-type repairs, but lacks substantial 
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catalytic activity. Enzymatic systems capable of recognition and 
correction of base pairing errors within the DNA helix have been 
• demonstrated in bacteria, fungi and mammalian cells. Enzymes from any 
such system is contemplated herein. The enzyme can be mutageriized 
5 using standard procedures, either directed mutagenesis if the catalytic 
site is known, or systematic mutagenesis to empirically identify suitable 
mutations. The resulting enzymes are selected for their ability to bind to 
abnormally, such as mismatched, paired DNA but to not effect repair or 
catalytic activity. Exemplary enzymes include, but are not limited to, 

10 a mutant mutH, a mutant mutL, a mutant mutM, a mutant mutS, a 
mutant mutY; a mutant uvrD, a mutant dam, a mutant thymidine DNA 
glycosylase (TDG), a mutant mismatch-specific DNA glycosylase (MUG), 
a mutant AlkA, a mutant MLH1, a mutant MSH2, a mutant MSH3, a 
mutant MSH6, a mutant Exonuclease I, a mutant T4 endonuclease V, a 

15 mutant FEN1 {RAD27), a mutant DNA polymerase 6, a mutant DNA 
polymerase e, a mutant RPA, a mutant PCNA, a mutant RFC, aYnutant 
Exonuclease V, a mutant DNA polymerase III holoenzyme, a mutant DNA 
helicase, a mutant RecJ exonuclease, a cleavase and combinations 
thereof (see below for definitions of each enzyme). 

20 Also provided herein are methods for detecting a mutation in a 

nucleic acid. The methods are performed by hybridizing a strand of a 
nucleic acid having or suspected of having a mutation with a 
complementary strand of a wild-type nucleic acid, whereby if a mutation 
is present, the resulting duplex contains an abnormal base-pairing; 

25 contacting the resulting duplex with a mutant nucleic acid repair enzyme 
or complex thereof; and detecting binding between the nucleic acid 
duplex and the mutant nucleic acid repair enzyme or complex thereof. 
The amount of enzyme bound is used to assess the presence or quantity 
of the mutation. Depending upon the mutant enzyme selected, the 

30 identity of the mismatch may be determined as well. The nucleic acid 
strand to be tested and the complementary wild-type nucleic acid strand. 
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sclerosis (ALS), Angelman syndrome (AS), Charcot-Marle-tooth disease 
(CMT), epilepsy, tremor, fragile X syndrome, Friedreich's ataxia (FRDA), 
Huntington disease (HD), Niemann-Pick, Parkinson disease, Prader-Willi 
syndrome (PWS), spinocerebellar atrophy and Williams syndrome. 
5 Examples of signal diseases and disorders include, but are not limited to, 
ataxia telangiectasia (A-T), male pattern baldness, acne, hirsutism, 
Cockayne syndrome, glaucoma, mammals with abnormal secondary 
sexual characteristics, tuberous sclerosis, Wa.ardenburg syndrome (WS) 
arid Werner syndrome (WRN). 
10 Exemplary transporter diseases and disorders include, but are not 

limited to, cystic fibrosis (CF), diastrophic dysplasia (DTD), long-QT 
syndrome (LQTS), Menkes' syndrome, pendred syndrome, adult 
polycystic kidney disease (APKD), Wilson's disease and Zellweger 
syndrome. 

1 5 Other examples of the diseases and disorders that can be detected 

by the present methods include, but are not limited to, a disease or 
disorder associated with an androgen receptor mutation, tetrahydro- 
biopterin deficiencies, X-Linked agammaglobulinemia, a disease or 
disorder associated with a factor VII mutation, anemia, a disease or 

20 disorder associated with a glucose-6-phosphate mutation, the glycogen 
storage disease type II (Pompe Disease), hemophilia A, a disease or 
disorder associated with a hexosaminidase A mutation, a disease or 
disorder associated with a human type I or type III collagen mutation, a 
disease or disorder associated with a rhodopsin or RDS mutation, a 

25 disease or disorder associated with a LI CAM mutation, a disease or 
disorder associated with a LDL receptor mutation, a disease or disorder 
associated with an ornithine transcarbamylase mutation, a disease or 
disorder associated with a PAX6 mutation and a disease or disorder 
associated with a von Willebrand factor mutation, 

30 The methods herein can also be used to detect infections and 

pathogens associated therewith. Such infection include, but are not 
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limited to, infections caused by a virus, a eubacteria, an archaebacteria 
and a eukaryotic pathogen. The infections can be caused by a mutant 
strain of a virus, an eubacterra, an archaebacteria or an eukaryotic 
pathogen. 

5 Exemplary viruses include, but are not limited to, a Delta virus, a 

dsDNA virus, a retroid virus, a satellite virus, a ssDNA virus, a ssRNA 
negative-strand virus, ssRNA positive-strand virus {no DNA stage) and a 
bacteriophage. Eubacteria include, but are not limited to, a green 
bacteria, a flavobacteria, a spirochetes, a purple bacteria, a gram-positive 

10 bacteria, a gram-negative bacteria, a cynobacteria, a deinococci and a 
thermotogale. Archaebacteria include, but are not limited to, an extreme 
halophile, a methanogen and an extreme thermophile. Eukaryotic 
pathogens include, but are not limited to, a fungi such as a yeast, a 
ciliate, a cellular slime mode, a flagellate and a microsporidia. 

1 5 In the above methods for detecting mutations, the hybridization 

between the strand of a nucleic acid having or suspected of having a 
mutation and the complementary strand of a wild-type nucleic acid pan 
be facilitated by a recombinase. Recombinase, include, but are not 
limited to, Cre recombinase, RAG-1 V(D)J recombinase, Endonuclease II 

20 of coliphage T4 and Flp recombinase. 

Also provided herein are methods for detecting polymorphisms, 
including single nucleotide polymorphisms (SNPs) at a gene locus or loci. 
The methods include hybridizing a target strand of a nucleic acid molecule 
that includes the locus to be tested with a complementary reference 

25 strand of a nucleic acid that has a known .allele of the locus. Allelic 
identity between the target and the refei-ence strand results in the 
formation of a nucleic acid duplex without an abnormal base-pairing, and 
allelic difference between the target and the reference strands results in 
the formation of a nucleic acid duplex with an abnormal base-pairing. 

30 The resulting nucleic acid duplex formed is contacted with a mutant 

nucleic acid repair enzyme or complex thereof that has binding affinity for 

-10- 
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the abnormal base-pairing in the duplex but has attenuated catalytic 
activity. Binding between the nucleic acid duplex and the mutant DNA 
repair enzyme or complex thereof is detected. The presence of a 
polymorphism is then assessed. Any polymorphism may be detected by 
5 these methods, and include, but are not limited to, a variable nucleotide 
type polymorphism ("VNTR"), a single nucleotide polymorphism (SNP), 
preferably a human genome SNP. 

In the above methods for detecting polymorphisms, the 
hybridization between the target strand of a nucleic acid comprising a 
10 locus to be tested and the complementary reference strand of a nucleic 
acid comprising a known allele of the locus can be facilitated by a 
recombinase. Recombinases include, but are not limited to, Cre 
recombinase, RAG-1 V{D)J recombinase, Endonuclease II of coliphage T4 
or Flp recombinase. 

15 Methods for selecting, purifying or removing a nucleic acid duplex 

containing one or more abnormal base-pairings in a population of nucleic 
acid duplexes are also provided. These methods are performed by 
contacting a population of nucleic acid duplexes having or suspected of 
including an abnormal base-pairing with a mutant DNA repair enzyme or 
, 20 complex thereof, where the mutant DNA repair enzyme or complex 

thereof has binding affinity for the abnormal base-pairing in the duplex 
but has attenuated catalytic activity, whereby the nucleic acid duplex 
containing one or more abnormal base-pairing binds to the mutant DNA 
repair enzyme or complex thereof to form a binding complex. The 

25 resulting complex can be removed from the population. The mutant 
enzyme can be presented and introduced into the population on a solid 
support, whereby duplexes in the population that contain an abnormal 
base pairing to which the mutant enzyme binds will bind to the enzyme 
on the solid support. In a specific embodiment, the population of nucleic 

30 acid duplexes contains DNArDNA, DNA:RNA or RNArRNA duplexes. The 
abnormal base-pairing to be removed includes a base-pair mismatch, a 

-11- 
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base insertion, a base deletion or a pyrimidine dimer. Preferably, the 
base-pair mismatch to be removed is a single base-pair mismatch. 

The population of nucleic acid duplexes is produced by an 
amplification, such as by a polymerase chain reaction or a reaction using 
5 reverse transcription and subsequent DNA amplification of one or more 
expressed RNA sequences. 

Further provided herein are methods for detecting and localizing an 
abnormal base-pairing in a nucleic acid duplex. These methods are 
performed by contacting a nucleic acid' duplex having or suspected of 

1 0 having an abnormal base-pairing with a mutant DNA repair enzyme or 
complex thereof, where the mutant DNA repair enzyme or complex 
thereof has binding affinity for the abnormal base-pairing in the duplex 
but has attenuated catalytic activity, whereby the nucleic acid duplex 
containing an abnormal base-pairing binds to the mutant DNA repair 

1 5 enzyme or complex thereof to form a binding complex; subjecting the 
nucleic acid duplex to hydrolysis with an exonuclease under conditions 
such that the binding complex blocks hydrolysis; and then determining 
the location within the nucleic acid duplex protected from the hydrolysis, 
thereby detecting and localizing the abnormal base-pairing in the nucleic 

20 acid duplex. In a specific embodiment, the nucleic acid duplex to be 
assayed is a DNA:DNA, a DNA:RNA or a RNArRNA duplex. Preferably, 
the nucleic acid duplex to be assayed is a DNArDNA duplex. The 
abnormal base-pairing to be detected and localized is a base-pair 
mismatch, a base insertion, a base deletion or a pyrimidine dimer. 

25 Preferably, the base-pair mismatch to be detected and localized is a single 
base-pair mismatch. Exemplary exonucleases, include, but are not limited 
to, BAL-31 exonuclease, exonuclease III, Mung Bean exonuclease and 
Lambda exonuclease. 

In the above methods for detecting abnormal base-pairings, 

30 mutations, and polymorphisms, and the methods for localizing and 
removing abnormal base-pairings, the mutant DNA repair enzyme or 
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complex thereof can be labelled. Preferably, the mutant DNA repair 
enzyme or complex thereof used therein is labelled, with a detectable 
label, such as biotin, a bioluminescence generating reagent, such as a 
luciferin or luciferase, a fluorescence label or a radiolabel, and the binding 
5 between the abnormal base-pairing and the labelled mutant DNA repair 
enzyme or complex thereof is detected, such as with a streptavidin 
labeled enzyme, generation of bioluminescence by contacting with 
luciferin or luciferase, or detection of the fluorescence or bound 
radioactivity. Labeled enzymes, include but are not limited to, a 

10 peroxidase, a urease, an alkaline phosphatase, a luciferase and a 
glutathione S-transf erase. The mutant repair enzyme may also be 
prepared as a conjugate, such as a chemical conjugate or fusion protein, 
with a detectable label or tag or enzyme or enzyme substrate. 

In the above methods for detecting abnormal base-pairings, 

1 5 mutations, and polymorphisms, and the methods for localizing and 
removing abnormal base-pairings, the target nucleic acid strand to be 
assayed, the reference nucleic acid strand, the target nucleic acid duplex 
to be assayed, the nucleic acid duplex formed via hybridization of the 
target strand and the reference strand, or the mutant DNA repair enzyme 

20 or complex thereof can be immobilized on the surface of a support, either 
directly or indirectly, such as via a linker. Preferably, the support used is 
an insoluble support such as a silicon chip. Support geomatrices, 
include, but are not limited to, beads, pellets, disks, capillaries, hollow 
fibers, needles, solid fibers, random shapes, thin films, membranes and 

25 chips. Also more preferably, the nucleic acid strand, the nucleic acid 
duplex or the mutant DNA repair enzyme or complex thereof is 
immobilized in an array or a well format on the surface of a support. 
Immobilization can be effected via covalent, ionic or other interactions, 
and can be direct or via a suitable linking moiety, such as 

30 heterobifunctional linker. 

In the above methods, one sample can be assayed at one time, but 
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preferably, the assays are performed in high-throughput format where a 
plurality of samples are assayed simultaneously. 

In the above methods, the target nucleic acid strand or target 
nucleic acid duplex can be synthesized or derived from a natural source. 
5 In a specific embodiment, the target strand of a nucleic acid or the target 
nucleic acid duplex is isolated from a natural sample, e.g,, a biosample. 
Preferably, the sample is a body fluid or a biological tissue. More 
preferably, the body fluid is urine, blood, plasma, serum, saliva, semen, 
stool, sputum, cerebral spinal fluid, tears, mucus or amniotic fluid. Also 

10 more preferably, the biological tissue is connective tissue, epithelium 

tissue, muscle tissue, nerve tissue, organs, tumors, lymph nodes, arteries 
and individual cell(s). 

Mutant enzymes that substantially retain binding affinity and 
specificity, but that have reduced catalytic activity are also provided. 

1 5 Compositions containing the mutant enzymes, kits and articles of . 
manufacture containing the mutant enzymes are also provided. In 
particular a mutant nucleic repair enzyme that retains binding affinity for 
abnormal base pairs in a nucleic acid duplex, but has reduced catalytic 
activity compared to wild type, such that the mutant enzyme 

20 quantitatively retain a duplex on a solid support, with a Ka of at least 
about 10^ more preferably 10®, most preferably 10^ M or higher. 

The mutant enzymes include a mutant mutL is an E Co// mutant 
mutL having a mutation selected from E29K, E32K, A37T, D58N, G60S, 
G93D, R95C, G96S, G96D, S112L, A16T, A16V, P305L, H308Y, . 

25 G238D, S106F and A271V; a mutant MLH1 that is a human mutant 
MLH1 having a mutation selected from among of P28L, M35R, S44F, 
G67R, I68N, I107R, T117R, T117M, R265H, V185G and G224D; a 
mutant mutS that has a mutation in its catalytic site, dimerization site, 
mutL interaction site or combinations thereof; a mutM that has a 

30 mutation in its catalytic site, mutY interaction site or a combination 
thereof, including an E Co// mutant mutM having a K57G or K57R 

.14- 
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mutation; a mutant mutY that has a mutation in its catalytic site, mutM 
interaction site or a combination thereof, in an E.Coli mutant mutY having 
a mutation selected from among E37S, V45N, G1 16D, D138N and 
K142A; or is a mutant uvrD that has a mutation in its catalytic site, ATP 
5 binding site or a combination thereof, including an E.Coli mutant uvrD 
having a mutation selected from among K35M, D220NE221Q, E221Q 
and Q251E; a mutant IVISH2 that has a mutation in its catalytic site, ATP 
binding site, ATPase site or a combination thereof, including an S. 
cerevisiae mutant MSH2 having a G693D or a G855D mutation and a 

10 human mutant MSH2 having a fragment encoding 195 amino acids within 
the C-terminal domain of hMSH-2 or having a K675R mutation; a mutant 
MSH6 that has a mutation in its catalytic site, ATP binding site, ATPase 
site or any combination thereof, including a human mutant MSH6 having 
a K1140R mutation, a complex of a human mutant MSH2 having a 

15 K675R mutation and a human mutant MSH6 having a K1140R mutation; 
and a mutant T4 endonuclease V that has a E23Q mutation. 

Solid supports, such as silicon chips, containing one or a plurality 
of the same or of different mutant enzymes conjugated, either directly or 
indirectly, thereto, are aliso provided. 

20 Kits and articles of manufacture for detecting abnormal base- 

pairings, mutations, polymorphisms, and for localizing and/or removing 
abnormal base-pairings are provided herein. The combinations, kits and 
articles of manufacture typically include one or more of the mutant 
enzymes, which may be in a composition or provided in an array or in 

25 combination with a support with linked nucleic acids. 
DETAILED DESCRIPTION 
TABLE OF CONTENTS 

A. DEFINITIONS 

B. METHODS FOR DETECTING ABNORMAL BASE-PAIRING 
30 1 . Mutant DNA repair enzyme or complex thereof 

a. Nucleic adds encoding DNA repair enzymes 
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20 2. DETECTING THE BINDING OF THE MUTANT ENZYME 

C. METHODS FOR DETECTING MUTATIONS IN NUCLEIC ACIDS FOR 
PROGNOSIS AND DIAGNOSIS OF DISEASES, DISORDERS AND 
INFECTIONS 

1 . Cancer 

25 a. Breast cancer 

b. Burkitt lymphorna 

c. Colon cancer 

d. Small cell lung carcinoma 

e. Melanoma carcinoma 

30 f. Multiple endocrine neoplasia 

g. Neurofibromatosis 
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h. Cancer associated with p53 mutation 

i. Pancreatic carcinoma 
j. Prostate cancer 

Cancer associated with Ras oncogene 
I. Retinoblastoma 
m. Von-Hippel Lindau syndrome 
Immune system diseases and disorders 

a. Autoimmune polyglandular syndrome type I 

b. Inflammatory bowel disease 

c. DiGeorge syndrome 

d. Familial Mediterranean fever 

e. Severe combined immunodeficiency 
Metabolism system diseases and disorders 

a. Adrenoleukodystrophy 

b. Atherosclerosis 

c. Gaucher disease 

d. Gyrate atrophy of the choroid 

e. Diabetes 

f. Obesity 

g. Paroxysmal nocturnal hemoglobinuria 

h. Phenylketonuria 

i. Refsum disease 
J. Tangier disease 

Muscle and bone diseases and disorders 

a. Duchenne muscular dystrophy 

b. Ellis-Van Creveld syndrome 

c. Marfan syndrome 

d. Myotonic dystrophy 

Nervous system diseases and disorders 

a. Alzheimer disease 

b. Amyotrophic lateral sclerosis 
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c. 


Angelman syndrome 




d. 


Cliarcot-l\/lar!e-tootli disease 




e. 


Epilepsy 




f. 


Tremor 


5 


9- 


Fragile X syndrome 




h. 


Friedreich's ataxia 
Huntington disease 




J- 


Niemann-Picic 




k. 


Parlcinson disease 


10 


L 


Spinocerebellar atroptiy 




m. 


Williams syndrome 




6. Signal diseases and disorders 




a. 


Ataxia telangiectasia 




b. 


Ma\e pattern baldness, acne or hirsutism 


15 


c. 


Cockayne syndrome 




d. 


Glaucoma 




e. 


Abnormal secondary sexual characteristics 




f. 


Tuberous sclerosis 




Ii. 


Waardenburg syndrome 


20 


i. 


Werner syndrome 



7. Transporter diseases and disorders 





a. 


Cystic fibrosis 




b. 


Diastrophic dysplasia 




c. 


Long-QT syndrome 


25 


d. 


l\/lenkes' syndrome 




e. 


Pendred syndrome 




f. 


Adult polycystic kidney disease 




g- 


Wilson's disease 




h. 


Zellweger syndrome 


30 


8. Infections 



D. METHODS FOR DETECTING POLYMORPHISMS 
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E. METHODS FOR REMOVING NUCLEIC ACID DUPLEX WITH 
ABNORMAL BASE-PAIRING 

F. METHODS FOR DETECTING AND LOCALIZING ABNORMAL BASE- 
5 PAIRING IN NUCLEIC ACID DUPLEX 

G. LABELLING OF MUTANT DNA REPAIR ENZYMES 

1 . Conjugation 

a. Fusion proteins 
10 b. Chemical conjugation 

1) Heterobifunctional cross-linking reagents 

2) Exemplary Linkers 

a) Acid cleavable, photocleavable and heat 
sensitive linkers 

15 b) Other linkers for chemical conjugation 

c) Peptide linkers 

2. Selection of facilitating agents 

a. Protein binding moieties 

1 ) Interaction trap/two-hybrid system 
20 2) Phage-based expression cloning 

3) Detection of protein-protein interactions 

b. Epitope tags 

c. IgG binding proteins 

1) pEZZ 18 Protein A gene fusion vector 
25 2) pRIT2T Protein A gene fusion vector 

3) The IgG Sepharose 6 fast flow system 

d. )?-galactosidase fusion proteins 

e. Nucleic acid binding moieties 
1) DNA binding proteins 

30 2) RNA binding proteins 

3) Preparation of nucleic acid binding proteins 

4) Assays for identifying nucleic acid binding 
proteins 
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a) Mobility shift DNA-binding assay 
. b) Basic mobility shift assay procedure 

c) Competition mobility shift assay 

d) Antibody supershift assay 

5 e) Methylation and uracil interference assay 

1) Methylation interference assays 

2) Uracil interference assay 

3) DNase I footprint analysis 

4} Screening a yigtl 1 expression 
1 0 library with recognition-site DNA 

5) Rapid separation of protein-bound 
DNA from free DNA 

f . Lipid binding moieties 

g. Polysaccharide binding moieties 
15 h. Metal binding moieties 

i. Other facilitating agents 

1) Peroxidase 

2) urease 

3) Allcaline phosphatase 
20 4) Luciferase 

5) Glutathione S-transferase 

6) Defense proteins 

7) Fluorescent moieties 

H. IMMOBILIZATION OF MUTANT ENZYMES AND NUCLEIC ACIDS 

25 

1 . Immobilization of the mutant enzymes 

2. Immobilization of nucleic acids 

I. HIGH-THROUGHPUT ASSAY FORMAT 

1 . High-throughput assay instrumentation and capabilities 
30 2. Detection technologies 

a. Radiochemical methods 

b. Non-lsotopic detection methods 
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1 ) Colorimetry and luminescence 

2) Resonance energy transfer 

3) Time-resolved fluorescence 

4) CelUbased fluorescence assays 
5 5) Fluorescence polarization 

6) Fluorescence correlation spectroscopy 
3. Miniaturization 
J. SAMPLE COLLECTION 

K. COMBINATIONS, KITS AND ARTICLES OF MANUFACTURE 
10 A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of 
ordinary skill in the art to which this invention belongs. All patents, 
applications, published applications and other publications and sequences - 

15 from GenBank and other data bases referred to herein are incorporated by 
reference in their entirety. 

As used herein, "base-pairing" refers to the specific hydrogen 
bonding between purines and pyrimidines in double-stranded nucleic 
acids. In DNA, the pairs are adenine (A) and thymine (T), and guanine (G) 

20 and cytosine (C), while in RNA they are adenine (A) and uracil (U), and 
guanine (G) and cytosine (C). Base-pairing leads to the formation of a 
nucleic acid double helix from two complementary single strands. 

As used herein, "nucleic acid duplex having abnormal base-pairing" 
refers to a nucleic acid duplex wherein there exists base-pair mismatch, 

25 i.e,, any base-pairing other than any of the normal A:T{U) and C;G pairs, 
a single-stranded loop region due to the addition of extra-nucleotide(s) in 
one strand and/or deletion of nucleotide(s) in the complementary strand, 
or a combination thereof. Non-limiting examples of base-pair mismatch 
include A:A, A:C, A:G, C:C, C:T, G:G, G:T, T:T, C:U, G:U, T:U, U:U, 5- 

30 formyluracil (fU):G, 7,8-dihydro-8-oxo-guanine (8-oxoG):C, 8-oxoG:A. 

As used herein, "enzyme" refers to a protein specialized to catalyze 
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or promote a specific metabolic reaction. Generally, enzymes are 
catalysts, but for purposes herein, such "enzymes" include those that 
would be modified during a reaction. Since the enzymes are modified to 
eliminate or substantially eliminate catalytic activity, they will not be so- 
5 modified during a reaction. 

As used herein, "DNA repair" refers to a process wherein the sites 
of mutations in DNA (DNA:DNA duplexes, DNA:RNA and, for purposes 
herein, also RNA:RNA duplexes) are recognized by a nuclease that 
excises the damaged or mutated region from the nucleic acid; and then 

10 further enzymes or enzymatic activities synthesize a replacement portion 
of a strand(s) so that the original sequence is preserved. 

As used herein, "DNA repair enzyme" refers to an enzyme that 
corrects errors in nucleic acid structure and sequence, i.e., recognizes, 
binds and corrects abnormal base-pairing in a nucleic acid duplex. DNA 

1 5 repair enzyme functions to protect genetic information against 

environmental damage and replication errors. Examples of DNA repair 
enzyme Include mutH, mutL, mutM, mutS, mutY, uvrD, dam, thymidine 
DNA glycosylase (TDG), mismatch-specific DNA glycosylase (MUG), 
AlkA, MLH1, MSH2, MSH3, MSH6, Exonuclease I, T4 endonuclease V, 

20 FEN1 (RAD27), DNA polymerase c5, DNA polymerase 6, RPA, PCNA and 
RFC. It is intended that DNA repair enzymes encompasses enzymes with • 
conservative amino acid substitutions that do not substantially alter repair 
activity. Suitable conservative substitutions of amino acids are known to 
those of skill in this art and may be made generally without altering the 

25 biological activity of the resulting molecule. Those of skill in this art 

redognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, 
e.g. . Watson et aK Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p. 224). 

30 Such substitutions are preferably made in accordance with those 

set forth in TABLE 1 as follows: 
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TABLE 1 



wiiyiriai resiiiuo 


uonservaiive suDSiiiiiiion 


Ala (A) 


Giy; Ser 


Arg (R) 


Lvs 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1) 


Leu; Val 


Leu (L) 


lie; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; lie 


Phe IF) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


lie; Leu 



Other substitutions are also permissible and may be determined 
empirically or in accord with known conservative substitutions. 

As used herein, the "amino acids," which occur in the various 
amino acid sequences appearing herein, are identified according to their 

25 well-known, three-letter or one-letter abbreviations. The nucleotides, 
which occur in the various DNA fragments, are designated with the 
standard single-letter designations used routinely in the art. 

As used herein, "a mutant DNA repair enzyme" (used 
interchangeably with "abnormal base-pairing trapping enzyme") refers to 

30 a mutant form of an enzyme that can repair errors in duplexes. The 
mutant, however, has binding affinity for the abnormal base-pairing in a 
nucleic acid duplex but lacks the catalytic activity whereby the abnormal 
pairing is excised. The mutant form of the repair enzyme that retains 
sufficient binding affinity for the abnormal base-pairing to be detected in 

35 the process or method, particularly assay, of interest. Typically this is at 
least about 10%, preferably at least about 50% binding affinity for the 
abnormal base-pairing, compared to its wildtype counterpart. Preferably, 
such mutant DNA repair enzyme retains 60%, 70%, 80%, 90%, 100% 
binding affinity for the abnormal base-pairing compared to its wildtype 
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counterpart, or has a higher binding affinity than its wildtype counterpart. 
Such mutant DNA repair enzynne is herein referred to as an "abnormal 
base-pairing trapping enzyme", i.e., a molecule that specifically binds to a 
selected abnormal base-pairing, but does not catalyze conversion thereof. 
5 The mutant enzyme possess substantially reduced such that the binding 
of the enzyme to the duplex can be detected. This is typically no more 
than about 50%, preferably no more than 20%, more preferably no more 
than about 10%, of the wild-type catalytic activity. 

As used herein the term "assessing" is intended to include 

10 quantitative and qualitative determination in the sense of obtaining an 
absolute value for the amount or concentration of the abnormal base- 
pairing present in the sample, and also of obtaining an index, ratio, 
percentage, visual or other value indicative of the level of abnormal base- 
pairing in the sample. Assessment may be direct or indirect and the 

1 5 chemical species actually detected need not of course be the abnormal 
base-pairing itself but may for example be a derivative thereof or some 
further substance. 

As used herein, "attenuated catalytic activity" refers to a mutant 
DNA repair enzyme that retains sufficiently reduced catalytic activity to 

20 be useful as a "pseudo-antibody", i.e., a molecule used in place of an 
antibody in immunoassay formats. The precise reduction in catalytic 
activity for use in the assays can be empirically determined for each 
assay. Typically, the enzyme will retain less than about 50% of one of 
its catalytic activities or less than 50% of its overall catalytic activities 

25 compared to its wildtype counterpart. Preferably, a mutant DNA repair 
enzyme retains less than 40%, 30%, 20%, 10%, 1%, 0.1%, or 0.01% 
of one of its catalytic activities or its overall catalytic activities compared 
to its wildtype counterpart. More preferably, a mutant DNA repair 
enzyme lacks detectable level of one of its catalytic activities or its overall 

30 catalytic activities compared to its wildtype counterpart. In instances in 
which catalytic activity is retained and/or a further reduction thereof is 
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desired, the contacting step can be effected in the presence of a catalysis 
inhibitor. Such inhibitors, include, but are not limited to, heavy metals, 
chelators or other agents that bind to a co-factor required for catalysis, 
but not for binding, and other such agents. 
5 As used herein, "mutH" refers to a procaryotic latent endonuclease 

that incises the transiently unmethylated strands of hemimethylated 5'- 
GATC-3' sequences. It is intended to encompass mutH with 
conservative amino acid. substitutions that do not substantially alter its 
activity, 

10 As used herein, "mutS" refers to a procaryotic DNA-mismatch. 

binding protein that can bind to a variety of mispaired bases and small (1- 
5 bases) single-stranded loops. It is intended to encompass mutS with 
conservative amino acid substitutions that do not substantially alter its 
activity. 

15 As used herein, "mutL" refers to a procaryotic protein that couples 

abnormal base-pairing recognition by mutS to mutH incision at the 5'- 
GATC-3' sequences in an ATP-dependent manner. It is intended to 
encompass mutL with conservative amino acid substitutions that do not 
substantially alter its activity. 

20 As used herein, "uvrD" refers to a procaryotic DNA helicase II that 

unwinds DNA in an ATP-dependent manner. It is intended to encompass 
uvrD with conservative amino acid substitutions that do not substantially 
alter its activity. 

As used herein, "dam" refers to a procaryotic adenine 

25 methyltransferases that plays a role in coordinating DNA replication 

initiation, DNA mismatch repair and the regulation of expression of some 
genes. It is intended to encompass dam with conservative amino acid 
substitutions that do not substantially alter its activity. 

As used herein, "mutM" refers to an 8-oxoguanine DNA 

30 glycosylase that rennoves 7,8-dihydro-8-oxoguanine (8-oxoG) and ' 
formamido pyrimidine (Fapy) lesions from DNA. It is Intended to 
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. encompass mutM vyith conservative amino acid substitutions that do not 
substantially alter its activity. 

As used herein, "rhutY" refers to an adenine glycosylase that is 
involved in the repair of 7;8-dihydro-8-oxo-2'-deoxyguanosine {OG):A and 
5 G:A mispairs in DNA. It is intended to encompass mutY with 

conservative amino acid substitutions that do not substantially alter its 
activity. 

As used herein, "TDG" refers to a thymine-DNA glycosylase that 
corrects G/T mispairs to G/C pairs. It is intended to encompass TDG with 
1 0 conservative amino acid substitutions that do not substantially alter Its 
activity. 

As used herein, "MUG" refers to a uracil-DNA glycosylase that 
corrects G/T and G/U mispairs to G/C pairs. It is intended to encompass 
MUG with conservative amino acid substitutions that do not substantially ' 

1 5 alter its activity. 

As used herein, "AlkA" refers to a 3-methyladenine DNA 
glycosylase II that corrects 5-formyluracil (fU)/G mispairs. It is intended 
to encompass AlkA with conservative amino acid substitutions that do 
not substantially alter its activity. 

ZO As usjsd herein, "MSH2" refers to the common component of the 

eukaryotic DNA repair complex MSH2-MSH6 (MutSa), which repairs 
base-base mispairs and insertion/deletion mispairs up to 12 unpaired 
bases, and the eukaryotic DNA repair complex MSH2-MSH3 (MutSI^), 
which repairs insertion/deletion mispairs having two or more unpaired 

25 bases but does not repair single'base insertion/deletion mispairs. As used 
herein, "MSH3" refers to the unique component of the "MSH2-MSH3" 
complex and "MSH6" refers to the unique component of the "MSH2- 
MSH6" complex. It is intended to encompass MSH2, MSH3 and MSH 6 
with conservative amino acid substitutions that do not substantially alter 

30 its respective activity. 

As used herein, "MLH1" and "PMS1" (PMS2 in humans) refers to 
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the components of the eukaryotic mutL-related protein complex, MLH1- 
PMS1, that interacts with IVISHZ-containing complexes bound to 
mispaired bases. It is intended to encompass MLH1 and PSM1 with 
conservative amino acid substitutions that do not substantially alter its 
5 respective activity. 

As used herein, "exonuclease I" refers to an eukaryotic 5'->3' 
exonuclease that has a preference for degrading double-stranded DNA. 
Exonuclease I involves in the DNA repair via its interaction with MSH2. It 
is intended to encompasis exonuclease ! with conservative amino acid 

10 substitutions that do not substantially alter its respective activity. 

As used herein, "T4 endonuclease V (EndoV)" refers to a base 
excision repair enzyme that removes thymine dimers (TD) from damaged 
DNA. It is intended to encompass T4 endonuclease V with conservative 
amino acid substitutions that do not substantially alter its respective 

15 activity. 

As used herein, "FENl (rad27)" refers to an evolutionarily 
conserved component of DNA replication complex. FENl processes 
Okazaki fragments during replication and is involved in base excision 
repair. FENl removes the last primer ribonucleotide on the lagging strand 

20 and it cleaves a 5' flap that may result from strand displacement during 
replication or during base excision repair. It is intended to encompass 
FENl (rad27) with conservative amino acid substitutions that do not 
substantially alter its respective activity. 

As used herein, "replication protein A (RPA)" refers to a 

25 heterotrimeric single-stranded DNA-binding protein that is highly 

conserved in eukaryotes. RPA plays essential roles in many aspects of 
nucleic acid metabolism, including DNA replication, nucleotide excision 
repair, and homologous recombination. It Is intended to encompass RPA 
with conservative amino acid substitutions that do not substantially alter 

30 its respective activity. 

As used herein, "proliferating cell nuclear antigen A (PCNA)" refers 
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to a DNA sliding clamp for DNA polymerase delta and is an essential 
component for eukaryotic chromosomal DNA replication. PCNA interacts 
with multiple partners, involved, for example, in Okazaki fragment joining, 
DNA repair, DNA methylation and chromatin assembly. PCNA is required 
5 for nucleotide excision repair, base excision repair and mismatch repair. 
DNA polymerases, RFC and PCNA recognize 3' ends of gaped DNA and 
fill the gaps by the same mechanism as used for joining of Okazaki 
fragments. It is intended to encompass PCNA with conservative amino 
acid substitutions that do not substantially alter its respective activity. 

10 - As used herein, "replication factor C (RFC)" refers to a five-subunit 
protein complex required for coordinate leading and lagging strand DNA 
synthesis during S phase and DNA repair in eukaryotic cells. RFC 
functions to load the proliferating cell nuclear antigen (PCNA), a 
processivity factor for polymerases delta and epsilon, onto primed DNA 

15 templates. This process, which is ATP-dependent, is carried out by 1) 
recognition of the primer terminus by RFC, 2) binding to and disruption of 
the PCNA trimer, and then 3) topologically linking the PCNA to the DNA. ^ 
It is intended to encompass RFC with conservative amino acid 
substitutions that do not substantially alter its respective activity. 

20 As used herein, "DNA polymerase e" refers to a mammalian DNA 

polymerase that has a tightly associated 3'-^5' exonuclease activity. 
DNA polymerase 6 is required at least for the repair synthesis of 
UV-damaged DNA. It is Intended to encompass DNA polymerase € with 
conservative amino acid substitutions that do not substantially alter its 

25 respective activity. 

As used herein, "DNA polymerase S" refers to a DNA polymerase 
that plays important roles in DNA replication, nucleotide excision repair, 
base excision repair and VDJ recombination. The function of DNA 
polymerase ,6 mgst be considered in the context of two other factors, 

30 PCNA and RFC, two protein complexes that build together the moving 
platform for DNA polymerase 6. This moving platform provides an 
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important framework for dynamic properties of an accurate DNA 
polymerase 6, such as its recruitment when its function is needed, the 
facilitation of DNA polymerase 6 binding to the primer terminus, the 
increase in DNA polymerase 6 processivity, the prevention of 
5 non-productive binding of the DNA polymerase 6 to single-stranded DNA, 
the release of DNA polymerase 6 after DNA synthesis and the bridging of 
DNA polymerase 6 interactions to other replication proteins. It is 
intended to encompass DNA polymerase 6 with conservative amino acid 
substitutions that do not substantially alter its respective activity. 

10 As used herein, "DNA polymerase III holoenzyme" refers to an 

enzyme that contains two DNA polymerases embedded in a particle with 
9 pther subunits. This multisubunit DNA polymerase is the E coli 
chronhosomal replicase, and it has several special features that distinguish 
it as a replicating machine. For example, one of its subunits is a circular • 

1 5 protein that slides along DNA while clamping the rest of the machinery to 
the template. Other subunits act together as a matchmaker to assemble 
the ring onto DNA. dverall, E coli DNA polymerase III holoenzyme is 
very similar in structure and function to the chromosomal replicases of 
eukaryotes, from yeast all the way up to humans., 

20 As used herein, "mutation" refers to change{s) in the nucleic acid 

length and/or sequence in an organism, which may arise in any of a 
variety of different ways, e.g., frame-shift mutation, non-sense mutation 
or missense mutation. 

As used herein, "disease or disorder" refers to a pathological 

25 condition in an organism resulting from, e.g., infection or genetic defect, 
and characterized by identifiable symptoms. 

As used herein, "cancer" refers to a pathological condition that 
* occurs when cell division gets out of control. Usually, the timing of cell 
division is under strict constraint, involving a network of signals that 

30 work together to say when a cell can divide, how often it should happen 
and how errors can be fixed. Mutations in one or more of the nodes in 
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this network can trigger cancer, be it through exposure to some 
environmental factor (e.g., tobacco smoke) or because of a genetic 
predisposition, or both. Usually, several cancer-promoting factors have to 
add up before a person will develop a malignant growth: with some 
5 exceptions, no one risk alone is sufficient. The predominant mechanisms 
for the cancers are (i) impairment of a DNA repair pathway (ii) the 
transformation of a normal gene into an oncogene and (iii) the 
malfunction of a tumor suppressor gene. 

As used herein, "an immune system disease or disorder" refers to a 

10 pathological condition caused by a defect in the immune system. The 
immune system is a complex and highly developed system, yet its 
mission is simple: to seek and kill invaders. If a person is born with a 
severely defective immune system, death from infection by a virus, 
bacterium, fungus or parasite will occur. In severe combined 

1 5 immunodeficiency, lack of an enzyme means that toxic waste builds up 
inside immune system cells, killing them and thus devastating the immune 
system. A lack of immune system cells Is also the basis for DiGeorge 
syndrome: improper development of the thymus gland means that T cell 
production is diminished. Most other immune disorders result from either 

20 an excessive immune response or an 'autoimmune attack'. For example, 
asthma, familial Mediterranean fever and Crohn disease (inflammatory 
bowel disease) all result from an over-reaction of the immune system, 
while autoimmune polyglandular syndrome and some facets of diabetes 
are due to the immune system attacking 'self cells and molecules. A key 

25 part of the immune system's role is to differentiate between invaders and 
the body's own cells - when it fails to make this distinction, a reaction 
against 'self cells and molecules causes autoimmune disease. 

As used herein, "a metabolism disease or disorder" refers to a 
pathological condition caused by errors in metabolic processes. 

30 Metabolism is the means by which the body derives energy and 

synthesizes the other molecules It needs from the fats, carbohydrates and 
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proteins we eat as food, by enzymatic reactions helped by minerals and 
vitamins. There is a significant level of tolerance of errors in the system: 
often, a mutation in one enzyme does not mean that the individual will 
suffer from a disease. A number of different enzymes may compete to 
5 modify the same molecule, and there may be more than one way to 
achieve the same end result for a variety of metabolic intermediates. 
Disease will only occur if a critical enzyme is disabled, or if a control 
mechanism for a metabolic pathway is affected. 

As used herein, "a muscle and bone disease or disorder" refers to a 

10 pathological condition caused by defects in genes important for the 

formation and function of muscles, and connective tissues. Connective 
tissue Is used herein as a broad term that includes bones, cartilage and 
tendons. For example, defects in fibrillin - a connective tissue proteins 
that is important in making the tissue strong yet flexible - cause Marfan 

15 syndrome, while diastrophic dysplasia is caused by a defect in a sulfate 
transporter found in cartilage. Two diseases that originate through a 
defect in the muscle cells themselves are Duchenne muscular dystrophy 
(DMD) and myotonic dystrophy (DM). DM is another 'dynamic mutation' 
disease, similar to Huntington disease, that involves the expansion of a 

20 nucleotide repeat, this time in a muscle protein kinase gene. DMD 
involves a defect in the cytoskeletal protein, dystrophin, which is 
important for maintaining cell structure. 

As used herein, "a nervous system disease or disorder" refers to a 
pathological condition caused by defects in the nervous system including 

25 the central nervous system, i.e., brain, and the peripheral nervous 
system. The brain and nervous system form an intricate network of 
electrical signals that are responsible for coordinating muscles, the 
senses, speech, memories, thought and emotion. Several diseases that 
directly affect the nervous system have a genetic component: some are 

30 due to a mutation in a single gene, others are proving to have a more 
complex mode of inheritance. As our understanding of the pathogenesis 
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of neurodegenerative disorders deepens, common themes begin to 
emerge: Alzheimer brain plaques and the inclusion bodies found in 
Parkinson disease contain at least one common component, while 
Huntington disease, fragile X syndrome and spinocerebellar atrophy are all 
5 'dynamic mutation' diseases in which there is an expansion of a DNA 
repeat sequence. Apoptosis is emerging as one of the molecular 
mechanisms invoked in several neurodegenerative diseases, as are other, 
specific, intracellular signaling events. The biosynthesis of myelin and the 
regulation of cholesterol traffic are also involved in Charcot-Marie-Tooth 

10 and Neimann-Pick disease, respectively. 

As used herein, "a signal disease or disorder" refers to a 
pathological condition caused by defects in the signal transduction 
process. Signal transduction within and between cells mean that they 
can communicate important information and act upon it. Hormones 

1 5 released from their site of synthesis carry a message to their target site, 
as in the case of leptin, >A(hich is released from adipose tissue (fat cells) 
and transported via the blood to the brain. Here, the leptin signals that 
enough has been eaten. Leptin binds to a receptor on the surface of 
hypothalamus cells, triggering subsequent intracellular signaling networks. 

20 Intracellular signaling defects account for several diseases, including 
cancers, ataxia telangiectasia and Cockayne syndrome. Faulty DNA 
repair mechanisms are also invoked in pathogenesis, since control of cell 
division, DNA synthesis and DNA repair all are inextricably linked. The 
end-result of many cell signals is to alter the expression of genes 

25 (transcription) by acting on DNA-binding proteins. Some diseases are the 
result of a lack of or a mutation in these proteins, which stop them from 
binding DNA in the normal way. Since signaling networks impinge on so 
many aspects of normal function, it is not surprising that so many 
diseases have at least some basis in a signaling defect. 

30 As used herein, "a transporter disease or disorder" refers to a 

pathological condition caused by defects in a transporter, channel or 
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pump. Transporters, channels or pumps that reside in cell membranes are 
key to maintaining the right balance of ions in cells, and are vital for 
transmitting signals from nerves to tissues. The consequences of defects 
in ion channels and transporters are diverse, depending on where they are 
5 located and what their cargo is. For example, in the heart, defects in 
potassium channels do not allow proper transmission of electrical 
impulses, resulting in the arrhythmia seen in long QT syndrome. In the 
lungs, failure of a sodium and chloride transporter found in epithelial cells 
leads to the congestion of cystic fibrosis, while one of the most common 

10 inherited forms of deafness, Pendred syndrome, looks to be associated 
with a defect in a sulphate transporter. 

As used herein, "virus" refers to obligate intracellular parasites of 
living but non-cellular nature, that contain DNA or RNA and a protein 
coat. Viruses range in diameter from about 20 to about 300 nm. Class I • 

1 5 viruses (Baltimore classification) have a double-stranded DNA as their 
genome; Class 11 viruses have a single-stranded DNA as their genome; 
Class III viruses have a double-stranded RNA as their genome; Class IV 
viruses have a positive single-stranded RNA as their genome, the genome 
itself acting as mRNA; Class V viruses have a negative single-stranded 

20 RNA as their genome used as a template for mRNA synthesis; and Class 
VI viruses have a positive single-stranded RNA genome but with a DNA 
intermediate not only in replication but also in mRNA synthesis. The 
majority of viruses are recognized by the diseases they cause in plants, 
animals and prokaryotes. Viruses of prokaryotes are known as 

25 bacteriophages. 

As used herein, "bacteria" refers to small prokaryoticorganisms 
(linear dimensions of around 1 //m) with non-compartmentalized circular 
DNA and ribosomes of about 70S. Bacteria protein synthesis differs from • 
that of eukaryotes. Many anti-bacterial antibiotics interfere with bacteria 

30 proteins synthesis but do not affect the infected host. 

As used herein, "eubacteria" refers to a major subdivision of the 
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bacteria except the archaebacteria. Most Gram-positive bacteria, 
cyanobacteria, mycoplasmas, enterobacteria, pseudomonas and 
chloroqlasts are eubacteria. The cytoplasmic membrane of eubacteria 
contains ester-linked lipids; there is peptidoglycan in the cell wall (if 
5 present); and no introns have been discovered in eubacteria. 

As used herein, "archaebacteria" refers to a major subdivision of 
the bacteria except the eubacteria. There are 3 main orders of 
archaebacteria: extreme halophiles, methanogens and sulphur-dependent 
extreme thermophiles. Archaebacteria differs from eubacteria in 

10 ribosomal structure, the possession (in some case) of introns, and other 
features including membrane composition. 

As used herein, "locus" refers to the site in linkage miap or on a 
chromosome where the nucleic acid sequence, e.g., gene, for a particular 
trait is located. Any one of the alleles of a sequence may be present at 

1 5 this site. 

As used herein, "an allele" refers to one of any different forms or 
variants of a gene found at the same place, or a locus, on a chromosome. 

As used herein, "polymorphism" refers to the existence, in a 
20 population, of two or more alleles of a nucleic acid sequence, e.g., gene, 
where the frequency of the rarer alleles is greater than can be explained 
by recurrent mutation alone (typically greater than 1 %). 

As used herein, "variable nucleotide type polymorphism ("VNTR")" 
refers to polymorphisms arising from spontaneous tandem duplications . 
25 of di- or trinucleotide repeated motifs of nucleotides. 

As used herein, "single nucleotide polymorphism ("SNP")" refers to 
polymorphisms arising froni the replacement of only a single nucleotide 
from the initially present gene sequence. 

As used herein, "enzymatic amplification" refers to an enzyme- 
30 catalyzed reaction by which nucleic acid, e.g., DNA, molecules are 
amplified. Examples of such reactions include the polymerase chain 
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reaction and reactions utilizing reverse transcription and subsequent DNA 
amplification of one or more expressed RNA sequences. 

As used herein, "exonuclease" refers to an enzyme that cleaves 
nucleotides one at time from the end of a polynucleotide chain. 
5 Exonuclease may be specific for either 5' or 3' end of DNA or RNA. If 
protein is bound to the nucleic acid, exonuclease cleavage stops when 
the exonuclease encounters the protein. 

As used herein, "recombinase" refers to an enzyme that catalyzes 
the inter-molecular formation of a nucleic acid duplex from 

10 single-stranded nucleic acids obtained from different sources, by a 

renaturation reaction. Such a recombinase is also capable of catalyzing a 
strand transfer reaction between a single-stranded nucleic acid from one 
source and double-stranded nucleic acid obtained from a different source. 
As used herein, "serum" refers to the fluid portion of the blood 

1 5 obtained after removal of the fibrin clot and blood cells, distinguished 
from the plasma in circulating blood. 

As used herein, "plasma" refers to the fluid, noncellular portion of 
the blood/ distinguished from the serum obtained after coagulation. 
As used herein, "substantially pure" means sufficiently 

20 homogeneous to appear free of readily detectable impurities as 
determined by standard methods of analysis, such as thin layer 
chromatography (TLC), gel electrophoresis and high performance liquid 
chromatography (HPLC), used by those of skill in the art to assess such 
purity, or sufficiently pure such that further purification would not 

25 detectably alter the physical and chemical properties, such as enzymatic 
and biological activities,. of the substance. Methods for purification of the 
compounds to produce substantially chemically pure compounds are 
known to those of skill in the art. A substantially chemically pure 
compound may, however, be a mixture of stereoisomers or isomers. In 

30 such instances, further purification might increase the specific activity of 
the compound. 
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As used herein, "biological activity" refers to the in vivo activities 
of a connpound or physiological responses that result upon in vivo 
administration of a compound, composition or other mixture. Biological 
activity, thus, encompasses therapeutic effects and pharmaceutical 
5 activity of such compounds, compositions and mixtures. Biological 
activities may be observed in vitro systems designed to test or use such 
activities. Thus, for purposes herein the biological activity of a luciferase 
is its oxygenase activity whereby, upon oxidation of a substrate, light is 
produced. 

10 ' As used herein, a "receptor" refers to a molecule that has an 
affinity for a given ligand. Receptors may be naturally-occurring or 
synthetic molecules. Receptors may also be referred to in the art as anti- 
ligands. As used herein, the receptor and anti-ligand are interchangeable. 
Receptors can be used in their unaltered state or as aggregates with other • 

15 species. Receptors may be attached, covalently or noncovalently, or in 
physical contact with, to a binding member, either directly or indirectly 
via a specific binding substance or linker. Examples of receptors, include, 
but are not limited to: antibodies, cell membrane receptors surface 
receptors and internalizing receptors, monoclonal antibodies and antisera 

20 reactive with specific antigenic determinants [such as on viruses, cells, or 
other materials], drugs, polynucleotides, nucleic acids, peptides, 
cofactors, lectins, sugars, polysaccharides, cells, cellular membranes, and 
organelles. 

Examples of receptors and applications using such receptors, 
25 include but are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to 
survival of microorganisms, which could serve as targets for antibiotic 
[ligand] selection; 

b) antibodies: identification of a ligand-binding site on the antibody 
30 molecule that combines with the epitope of an antigen of interest may be 

investigated; determination of a sequence that mimics an antigenic 

-36- 



wo 01/62968 



PCTAJSOl/00452 



epitope may lead to the development of vaccines of which the 
immunogen is based on one or more of such sequences or lead to the 
development of related diagnostic agents or compounds useful in 
therapeutic treatments such as for auto-immune diseases 
5 c) nucleic acids: identification of ligand, such as protein or RNA, 

binding sites; 

d) catalytic polypeptides: polymers, preferably polypeptides, that 
are capable of promoting a chemical reaction involving the conversion of 
one or more reactants to one or more products; such polypeptides 

10 generally include a binding site specific for at least one reactant or 

reaction intermediate and an active functionality proximate to the binding 
site, in which the functionality is capable of chemically modifying the 
bound reactant [see, e.g. , U.S. Patent No. 5,215,899]; 

e) hormone receptors: determination of the ligands that bind with • 
1 5 high affinity to a receptor is useful in the development of hormone 

replacement therapies; for example, identification of ligands that bind to 
such receptors may lead to the development of drugs to control blood 
pressure; and 

f) opiate receptors: determination of ligands that bind to the opiate 
20 receptors in the brain is useful in the development of less-addictive 

replacements for morphine and related drugs. 

As used herein, "antibody" includes antibody fragments, such as 

Fab fragments, which are composed of a light chain and the variable 

region of a heavy chain. 
25 As used herein, "humanized antibodies" refer to antibodies that are 

modified to include "human" sequences of amino acids so that 

administration to a human will not provoke an immune response. 

Methods for preparation of such antibodies are known. For example, the 

hybridoma that expresses the monoclonal antibody is altered by 
30 recombinant DNA techniques to express an antibody in which the amino 

acid composition of the non-variable regions is based oh human 
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antibodies. Computer programs have been designed to identify. such 
regions. 

As used herein, "production by recombinant means" refers to 
production methods that use recombinant nucleic acid methods that rely 
5 on well known methods of molecular biology for expressing proteins 
encoded by cloned nucleic acids. 

As used herein, "substantially identical" to a product means 
sufficiently similar so that the property of interest is sufficiently 
unchanged so that the substantially identical product can be used in place 
10 of the product. 

As used herein, "equivalent," when referring to two sequences of 
nucleic acids means that the two sequences in question encode the same 
sequence of amino acids or equivalent proteins. It also encompasses 
those that hybridize under conditions of moderate, preferably high 
1 5 stringency, whereby the. encoded protein retains desired properties. 

As used herein, when "equivalent" is used in referring to two 
proteins or peptides, it means that the two proteins or peptides have 
. substantially the same amino acid sequence with only conservative amino 
acid substitutions (see, e.g., Table 1, above) that do not substantially 
20 alter the activity or function of the protein or peptide. 

When "equivalent" refers to a property, the property does not 
need to be present to the same extent [ e.g. , two peptides can exhibit 
different rates of the same type of enzymatic activity], but the activities 
are preferably substantially the same. "Complementary," when referring 
25 to two nucleic acid molecules, means that the two sequences of 

nucleotides are capable of hybridizing, preferably with less than 25%, 
more preferably with less than 15%, even more preferably with less than 
5%, most preferably with no mismatches between opposed nucleotides. 
Preferably the two molecules will hybridize under conditions of high 
30 stringency. 
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As used herein: "stringency of liybridization" in determining 
percentage misnnatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65^C; 

2) nnedium stringency: 0.2 x SSPE, 0.1% SDS, 50°C (also 
5 referred to as moderate stringency); and 

3) low stringency: 1 ,0 x SSPE, 0,1 % SDS, 50°C. 

It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. 

The term "substantially" identical or homologous or similar varies 
10 with the context as understood by those skilled in the relevant art and 
generally means at least 70%, preferably means at least 80%, rnore 
preferably at least 90%, and most preferably at least 95% identity. 

As used herein, a "composition" refers to a any mixture of two or 
more products or compounds. It may be a solution, a suspension, liquid, - 
1 5 powder, a paste, aqueous, non-aqueous or any combination thereof. 

As used herein, a "combination" refers to any association between 
two or among more items. 

As used herein, "fluid" refers to any composition that can flow. 
Fluids thus encompass compositions that are in the form of semi-solids, 
20 pastes, solutions, aqueous mixtures, gels, lotions, creams and other such 
compositions. 

As used herein, "vector (or plasmid)" refers to discrete elements 
that are used to introduce heterologous DNA into cells for either 
expression or replication thereof. Selection and use of such vehicles are 

25 well known within the skill of the artisan. An expression vector includes 
vectors capable of expressing DNAs that are operatively linked with 
regulatory sequences, such as promoter regions, that are capable of 
effecting expression of such DNA fragments. Thus, an expression vector 
refers to a recombinant DNA or RNA construct, such as a plasmid, a 

30 phage, recombinant virus or other vector that, upon introduction into an 
appropriate host cell, results in expression of the cloned DNA. 
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Appropriate expression vectors are well known to those of skill in the art 
and include those that are replicable in eukaryotic cells and/or prokaryotic 
cells and those that remain episomal or those which integrate into the 
host cell genome. 

5 As used herein, "a promoter region or promoter element" refers to 

a segment of DNA or RNA that controls transcription of the DNA or RNA 
to which it is operatively linked. The promoter region includes specific 
sequences that are sufficient for RNA polymerase recognition, binding 
and transcription initiation. This portion of the promoter region is referred 

10 to as the promoter. In addition, the promoter region includes sequences 
that modulate this recognition, binding and transcription initiation activity 
of RNA polymerase. These sequences may be cis acting or may be 
responsive to trans acting factors. Promoters, depending upon the nature 
of the regulation, may be constitutive or regulated. Exemplary promoters - 

15 contemplated for use in prokaryotes include the bacteriophage T7 and T3 
promoters, and the like. 

As used herein, "operatively linked or operationally associated" 
refers to the functional relationship of DNA with regulatory and effector 
sequences of nucleotides, such as promoters, enhancers, transcriptional 

20 and translational stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the physical and 
functional relationship between the DNA and the promoter such that the 
transcription of such DNA is initiated from the promoter by an RNA 
polymerase that specifically recpgnizes, binds to and transcribes the 

25 DNA. In order to optimize expression and/or in vitro transcription, it may 
be necessary to remove, add or alter 5' untranslated portions of the 
clones to eliminate extra, potential inappropriate alternative translation 
initiation {i.e., start) codons or other sequences that may interfere with or 
reduce expression, either at the level of transcription or translation. 

30 Alternatively, consensus ribosome binding sites (see, e.g.. Kozak, J. Biol. 
Ctiem., 266:19867-19870 (1991)) can be inserted immediately 5' of the 
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Start codon and may enhance expression. The desirability of (or need for) 

such modification may be empirically determined. 

As used herein, "sample" refers to anything which may contain an 

analyte for which an analyte assay is desired. The sample may be a 
5 biological sample, such as a biological fluid or a biological tissue. 

Examples of biological fluids include urine, blood, plasma, serum, saliva, 

semen, stool, sputum, cerebral spinal fluid, tears, mucus, amniotic fluid 

or the like. Biological tissues are aggregates of cells, usually of a 

particular kind together with their intercellular substance that form one of 
10 the structural materials of a human, animal, plant, bacterial, fungal or viral 

structure, including connective, epithelium, muscle and nerve tissues. 

Examples of biological tissues also include organs, tumors, lymph nodes, 

arteries and individual cell(s). 

As used herein, "replication" refers to a process of DNA-dependent - 
15 DNA synthesis wherein the DNA molecule is duplicated to give identical 

copies. 

As used herein, "transcription" refers to a process of DNA- 
dependent RNA synthesis. 

As used herein, "recombination" refers to a reaction between 

20 homologous sequences of DNA. The critical feature is that the enzymes 
responsible for recombination can use any pair of homologous sequences 
as substrates, although some types of sequences may be favored over 
others. Recombination allows favorable or unfavorable mutations to be 
separated and tested as individual units in new assortments. 

25 As used herein, "DNA structure maintenance" refers to DNA 

sequences, through binding to proteins, that maintain the DNA molecule 
in particular structures such as chromatids, chromatins or chromosomes. 

As used herein, "DNA polymerase" refers to an enzyme that 
synthesizes DNA using a DnX as the template. It is intended to 

30 encompass DNA polymerase with conservative amino acid substitutions 
that do not substantially alter its activity. 
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As used herein, "DNA-dependent RNA polymerase" or 
"transcriptase" refers to an enzyme that synthesizes RNA using a DNA as 
the template. It Is intended to encompass DNA-dependent RNA 
polymerase with conservative amino acid substitutions that do not 
5 substantially alter its activity. 

As used herein, "DNAase" refers to an enzyme that attacks bonds 
in DNA. It is intended to encompass DNAase with conservative amino 
acid substitutions that do not substantially alter its activity. 

As used herein, "DNA ligase" refers to an enzyme that catalyses 
1 0 the formation of a phosphodiester bond to link two adjacent bases 

separated by a nick in one strand of double helix of DNA. It is intended 
to encompass DNA ligase with conservative amino acid substitutions that 
do not substantially alter its activity. 

As used herein, "DNA topoisomerase" refers to an enzyme that can - 
1 5 change the linking number of DNA. It is intended to encompass DNA 
topoisomerase with conservative amino acid substitutions that do not 
substantially alter its activity. 

As used herein,. "DNA transposase" refers to an enzyme that is 
involved in insertion of a transpQson at a new site. It is intended to 
20 encompass DNA transposase with conservative amino acid substitutions 
that do not substantially alter its activity. 

As used herein, "Transposon" refers to a DNA sequence that is 
able to replicate and insert one copy at a new location in the genome. 

As used herein, "DNA kinase" refers to an enzyme that 
25 phosphorylates DNA. It is intended to encompass DNA kinase with 
conservative amino acid substitutions that do not substantially alter its 
activity. 

As used herein, "restriction enzyme" refers to an enzyme that 
recognizes specific short sequences of DNA and cleaves the duplex at the 
30 recognition site or other site. It is intended to encompass a restriction 
enzyme with conservative amino acid substitutions that do not 
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substantially alter its activity. 

As used herein, "rRNA" or "ribosomal RNA" refers to the RNA 
components of the ribosome, a compact ribonucleoprotein particle that 
assembles amino acids into proteins. 
5 As used herein, "mRNA" or "messenger RNA" refers to the RNA 

molecule that bears the same sequence of the DNA coding strand and is 
used as the template in protein synthesis. 

As used herein, "tRNA" or "transfer RNA" refers to the RNA 
molecule that carries amino acids to the ribosome for protein synthesis. 
10 As used herein, "reverse transcription" refers to the RNA- 

dependent DNA synthesis. 

As used herein, "RNA splicing" refers to the removal of introns and 
joining of axons in RNA so that introns are spliced out and axons are 
spliced together. 

15 As used herein, "RNA-dependent DNA polymerase" or "reverse 

transcriptase" refers to an enzyme that synthesizes DNA using a RNA as 
the template. It is intended to encompass a RNA-dependent DNA 
polymerase with conservative amino acid substitutions that do not 
substantially alter its activity. 

20 As used herein, "RNA-dependent RNA polymerase" refers to an 

enzyme that synthesizes RNA using a RNA as the template. It is intended 
to encompass a RNA-dependent RNA polymerase with conservative 
amino acid substitutions that do not substantially alter its activity. 

As used herein, "RNA ligase" refers to an enzyme that catalyses 

25 the formation of a phosphodiester bond to link two adjacent bases 

separated by a nick in one strand of RNA. It is intended to encompass a 
RNA ligase with conservative amino acid substitutions that do not 
substantially alter its activity. 

As used herein, "RNA maturase" refers to an enzyme that 

30 catalyses the removal of intron in the RNA splicing. It is Intended to 
encompass a RNA maturase with conservative amino acid substitutions 
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that do not substantially alter its activity. 

As used herein, "luminescence" refers to the detectable EM 
radiation, generally, UV, IR or visible EM radiation that is produced when 
the excited product of an exergic chemical process reverts to its ground 
5 state with the emission of light. Chemiluminescence is luminescence that 
results from a chemical reaction. Bioluminescence is chemiluminescence 
that results from a chemical reaction using biological molecules or 
synthetic versions or analogs thereof as substrates and/or enzymes. 
As used herein, "bioluminescence," which is a type of 

10 chemiluminescence, refers to the emission of light by biological 
molecules, particularly proteins. The essential condition for 
bioluminescence is molecular oxygen, either bound or free in the presence 
of an oxygenase, a luciferase, which acts on a substrate, a luciferin. 
Bioluminescence is generated by an enzyme or other protein (luciferase) 

1 5 that is an oxygenase that acts on a substrate luciferin (a bioluminescence 
substrate) in the presence of molecular oxygen and transforms the 
substrate to an excited state, which upon return to a lower energy level 
releases the energy in the form of light. 

As used herein, the substrates and enzymes for producing 

20 bioluminescence are generically referred to as luciferin and luciferase, 
respectively. When reference is made to a particular species thereof, for 
clarity, each generic term is used with the name of the organism from 
which it derives, for example, bacterial luciferin or firefly luciferase. 

As used herein, "luciferase" refers to oxygenases that catalyze a 

25 light emitting reaction. For instance, bacterial lucif erases catalyze the 
oxidation of flavin rhpnonucleotide [FMN] and aliphatic aldehydes, which 
reaction produces light. Another class of luciferases, found among 
marine arthropods, catalyzes the oxidation of Cypridina [Vargula] 
luciferin, and another class of lucif erases catalyzes the oxidation of 

30 Coleoptera luciferin. 
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Thus, lucif erase refers to an enzyme or photoprotein that catalyzes 
a bioluminescent reaction [a reaction that produces bioluminescence]. 
The luciferases, such as firefly and Renilla luciferases, that are enzymes 
which act catalytically and are unchanged during the bioluminescence 
5 generating reaction. The luciferase photoproteins, such as the aequorin 
photoprotein to which luciferin is non-covalently bound, are changed, 
such as by release of the luciferin, during bioluminescence generating 
reaction. The luciferase is a protein that occurs naturally in an organism 
or a variant or mutant thereof, such as a variant produced by 

1 0 mutagenesis that has one or more properties, such as thermal stability, 
that differ from the naturally-occurring protein. Luciferases and modified 
mutant or variant forms thereof are well known. For purposes herein, 
reference to luciferase refers to either the photoproteins or luciferases. 
As used herein, "peroxidase" refers to an enzyme that catalyses a 

1 5 host of reactions in which hydrogen peroxide is a specific oxidizing agent 
and a wide range of substrates act as electron donors. It is intended to 
encompass a peroxidase with conservative amino acid substitutions that 
do not substantially alter its activity. Peroxidases are widely distributed 
in nature and are produced by a wide variety of plant species. The chief 

20 commercially available peroxidase is horseradish peroxidase. 

As used herein, "urease" refers to an enzyme that catalyses 
decomposition of urea to form ammonia and carbon dioxide. It is 
intended to encompass an urease with conservative amino acid 
substitutions that do not substantially alter its activity. Urease is widely 

25 found in plants, animals and microorganisms. 

As used herein, ''alkaline phosphatases" refers to a family of 
functionally related enzymes named after the tissues in which they 
predominately appear. Alkaline phosphatases carry out 
hydrolase/transferase reactions on phosphate-containing substrates at a 

30 high pH optimum. It is intended to encompass an alkaline phosphatases . 
with conservative amino acid substitutions that do not substantially alter 
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its activity. 

As used herein, "glutathione S-transferase" refers to a ubiquitous 
family of enzymes with dual substrate specificities that perform important 
biochemical functions of xenobiotic biotransformation and detoxification, 
5 drug metabolism, and protection of tissues against peroxidative damage. 
The basic reaction catalyzed by glutathione S-transferase is the 
conjugation of an electrophile with reduced glutathione (GSH) and results 
in either activation or deactivatiorl/detoxification of the chemical. It is 
intended to encompass a glutathione S-transferase with conservative 

10 amino acid substitutions that do not substantially alter its activity. 

As used herein, high-throughput screening (HTS) refers to 
processes that test a large number of samples, such as samples of 
diverse chemical structures against disease targets to identify "hits" (see, 
e.g., Broach et al. High throughput screening for drug discovery. Nature, - 

15 554:14-16 (1996); Janzen, etaL High throughput screening as a 

discovery tool in the pharmaceutical industry, Lab Robotics Automation: 
5261-265 (1996); Fernandes, P.B., Letter from the society president, J. 
BiomoL Screening, 2'A (1997); Burbaum, efaA, New technologies for 
high-throughput screening, Curr. Opin. Chem. Biol,, 7:72-78 (1997)]. 

20 HTS operations are highly automated and computerized to handle sample 
preparation, assay procedures and the subsequent processing of large 
volumes of data. 

As used herein, the abbreviations for any protective groups, amino 
acids and other, compounds, are, unless indicated otherwise, in accord 
25 with their common usage, recognized abbreviations, or the lUPAC-lUB 
Commission on Biochemical Nomenclature (see, (1972) Biochem, 
11:1726). 

For clarity of disclosure, and not by way of limitation, the detailed 
description is divided into the subsections that follow. 
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B. METHODS FOR DETECTING ABNORMAL BASE-PAIRING 

Provided herein are methods for detecting abnormal base-pairing in 
a nucleic acid duplex. Detection of abnormal base pairing has numerous 
applications, such as in diagnostics, mutational analyses and 
5 polymorphism identification. The method involves binding a mutant 

enzyme that specifically binds to mismatched base pairs in a DNA duplex, 
DNA:RNA duplex, or RNA:RNA duplex, and detecting such binding, which 
can be quantitative. By virtue of the base specificity of the certain 
enzymes the identity of the abnormal base pairing may be determined. 

10 The reactions can. be performed in various formats, including 

solution and solid phase reactions. Solid supports to which nucleic acid 
or enzyme is bound.. In addition, the resulting complexes of enzyme 
bound to nucleic acid can be captured on solid supports by virtue of 
interaction of the nucleic acid with other nucleic acids on the supports or 

1 5 the enzyme with moieties on the supports. 

The preferred forrinats herein are those that are amenable to high 
throughput analyses, such as chip-based reactions in which nucleic acid 
probes of known sequence are arranged, such as in an array on a 
support, and reacted with a sample, such as nucleic acid from a body 

20 fluid or tissue. 

In a particular embodiment, the method is performed by 
contacting a nucleic acid duplex having or suspected of having an 
abnormal base-pairing with a mutant DNA repair enzyme or complex 
thereof, where the mutant DNA repair enzyme or complex thereof has 

25 binding affinity for the abnormal base-pairing in the duplex but has 
attenuated catalytic activity; and then detecting binding between the 
nucleic acid duplex and the mutant DNA repair enzyme or complex 
thereof, whereby the presence or quantity of the abnormal base-pairing in 
the duplex is assessed. 

30 As noted, the nucleic acid duplex to be assayed is a DNA:DNA, a 

DNA:RNA or a RNA:RNA duplex. Preferably, the nucleic acid duplex to 
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be assayed is a DNA:DNA duplex. The abnormal base-pairing to be 
detected includes a base-pair mismatch, a base insertion, a base deletion 
and a fDyrimidine dimer. Preferably, the base-pair mismatch to be 
detected is a single base7pair mismatch. Non-limiting examples of the 
5 base-pair mismatch that can be detected include A:A, A:C, A:G, C:C, 
C:T, G:G, G:T, T:T, C:U, G:U, T:U, U:U, 5-formyluracil (fU):G, 
7,8-dihydro-8-oxo-guanine (8-oxoG):C, 8-oxoG:A or a combination 
thereof. Also preferably, the base insertion or base deletion to be 
detected is a single base insertion or deletion. For example, the base 

10 insertion or base deletion resulting in a single-stranded loop containing 
about 1-5 bases or a loop containing more than 5 bases can be detected. 
1. MUTANT DNA REPAIR ENZYME OR COMPLEX THEREOF 
Any mutant DNA repair enzyme or complex thereof that has 
binding affinity for the abnormal base-pairing in the duplex but has 

1 5 attenuated catalytic activity can be used in the present methods. Such 
enzymes may be prepared by mutagenensis of nucleic acids encoding the 
enzyme and selection of the expressed protein for the requisite binding 
properties and reduced or absent catalytic activities. 

Mutant enzymes having the desired specificity can be prepared 

20 using routine mutagenesis methods. Residues to mutate can be identified 
by systematically mutating residues to different residues, and identifying 
those that have the desired reduction in catalytic activity and retention of 
binding activity for a particular abnormal base-pairing. Alternatively or 
additionally, mutations may be based upon predicted or known 3-D 

25 structures of enzymes, including predicted affects of various mutations 
(see, e.g., Turner et al. (1998) Nature Structural Biol. 5:369-376; Ault- 
Richi6 etal. (1994) J. Biol. Chem. 269 :31472-31478; Yuan et al. 

(1 996) J. Biol. Chem. 271:28009-2801 6; Williams et al. (1 998) 
Biochemistrv 37:7096: Steadman ef a/. (1998) Biochemistry 37:7089- 

30 7095; Finer-Moore ef a/. (1998) J. MoL Biol. 276 :113-129; Strop etal. 

(1997) Protein Sci. 6:2504-2511; Finer-Moore ef sA (1996) Biochemistrv 
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35:5125-5136; Schiffer et aL (1995) Biochemistry 34:16279-16287: 
Costi etal. (1996) Biochemistry 35:3944-3949: Grayes et al. (1992) 
Biochemistry 31 :1 5-21 ; Carreras ef a/. (1992) Biochemistry 31:6038- 
6044). Such predictions can be made by those of skill in the art of 
5 computational chemistry. Hence, for any selected enzyme, the mutations 
need to inactivate catalytic activity but retain binding activity can be 
determined empirically. 

Mutant enzymes can be selected for example by plating plasmids 
containing DNA containing mutagenized genes in wells coated with 

10 duplexes containing mismatches, expressing the proteins, and looking for 
binding to the mismatched duplexes, and selecting the nucleic acid that 
expressed the proteins that bound thereto. 

A typical mutant enzyme, is a DNA repair enzyme with a mutation 
that attenuates the catalytic activity, but that has little or small effects on • 

15 the binding activity. By selecting the enzymes that bind to duplexes, 
which are retained on a support, enzymes with the desired specificity and 
lack of catalytic activity will be selected. Enzymes the retain catalytic 
activity, will not remain bound. 

Exemplary DNA repair enzyme and complexes thereof that can be 

20 mutated for use in the methods herein, include, but are not limited to, a 
mutant mutH, a mutant mutL, a mutant mutM, a mutant mutS, a mutant 
mutY, a mutant uvrD, a mutant, dam, a mutant thymidine DNA 
glycosylase (TDG), a mutant mismatch-specific DNA glycosylase (MUG), 
a mutant AlkA, a mutant MLH1, a mutant MSH2, a mutant MSH3, a 

25 mutant MSH6, a mutant Exonuclease I, a mutant T4 endonuclease V, a 
mutant FEN1 (RAD27), a mutant DNA polymerase 6, a mutant DNA 
polymerase e, a mutant RPA, a mutant PCNA, a mutant RFC, a mutant 
Exonuclease V, a mutant DNA polymerase III holoenzyme, a mutant DNA 
helicase, a mutant RecJ exonuclease or a combination thereof. 
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a. Nucleic acids encoding DNA repair enzymes 
Nucleic acids encoding DNA repair enzymes can be obtained by 
metfiods Icnown in the art. Known nucleic acid sequences of DNA repair 
enzymes can be used in isolating nucleic acids encoding DNA repair 
5 enzymes from natural or other sources. Alternatively, complete or partial 
nucleic acids encoding DNA repair enzymes can be obtained by chemical 
synthesis according to the known sequences or obtained from 
commercial or other sources. 

Eul<aryotic cells and prokaryotic ceils can serve as a nucleic acid 

10 source for the isolation of nucleic acids encoding DNA repair enzymes. 
The DNA can be obtained by standard procedures known in the art from 
cloned DNA {e.g., a DNA "library"), chemical synthesis, cDNA cloning, or 
by the cloning of genomic DNA, or fragments thereof, purified from the 
desired cell (see, for example, Sambrook et at., 1989, Molecular Cloning, - 

15 A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York; Glover, D.M. (ed.), 1986, DNA Cloning: A 
Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II.), Clones 
derived from genomic DNA can contain regulatory and intron DNA regions 
in addition to coding regions; clones derived from cDNA or RNA contain 

20 only exon sequences. Whatever the source, the gene is generally 
molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from cDNA, cDNA can be 
generated from total cellular RNA or mRNA by methods that are known in 
the art. The gene can also be obtained from genomic DNA, where DNA 

25 fragments are generated (e.g., using restriction enzymes or by mechanical 
shearing), some of which will encode the desired gene. The linear DNA 
fragments can then be separated according to size by standard 
techniques, including but not limited to, agarose and polyacrylamide gel 
electrophoresis and column chromatography. 

30 Once the DNA fragments are generated, identification of the 

specific DNA fragment containing all or a portion of the DNA repair 
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enzymes gene can be accomplished in a number of ways. 

A preferred method for isolating an DNA repair enzyme gene is by 

the polymerase chain reaction (PCR), which can be used to amplify the 

desired DNA repair enzyme sequence in a genomic or cDNA library or 
5 from genomic DNA or cDNA that has not been incorporated into a library. 

Oligonucleotide primers which hybridize to the DNA repair enzyme 

sequences can be used as primers in PCR. 

Additionally, a portion of the DNA repair enzyme (of any species) 

gene or its specific RNA, or a fragment thereof, can be purified (or an 
10 oligonucleotide synthesized) and labeled, the generated DNA fragments 

may be screened by nucleic acid hybridization to the labeled probe 

(Benton, W. and Davis, R., 1977, Science 196:180: Grunstein, M. And 

Hogness, D., 1975, Proc. Natl. Acad. ScL U.S.A. 72:3961). Those DNA 

fragments with substantial homology to the probe will hybridize. The 
1 5 DNA repair enzyme nucleic acids can be also identified and isolated by 

expression cloning using, for example, DNA repair activities or anti-DNA 

repair enzyme antibodies for selection. 

Alternatives to obtaining the DNA repair enzyme DNA by cloning or 

amplification include, but are not limited to, chemically synthesizing the 
20 gene sequence itself from the known DNA repair enzyme nucleotide 

sequence or making cDNA to the mRNA which encodes the DNA repair 

enzyme. Any suitable method known to those of skill in the art may be 

employed. 

Once a clone has been obtained, its identity can be confirmed by 
25 nucleic acid sequencing {by methods known in the art) and comparisort to 
known DNA repair enzyme sequences. DNA sequence analysis can be 
performed by techniques known in the art, including but not limited to, 
the method of Maxam and Gilbert (1980, Meth. Enzymol. 65:499-560), 
the Sanger dideoxy method (Sanger, F., et a!., 1977, Proc. Natl. Acad. 
30 ScL U.S.A. 74:5463), the use of T7 DNA polymerase (Tabor and 
Richardson, U.S. Patent No. 4,795,699), use of an automated DNA 
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sequenator (e.g., Applied Biosystems, Foster City, CA). 

Nucleic acids which are hybridizable to a DNA repair enzyme 
nucleic^acid, or to a nucleic acid encoding an DNA repair enzyme 
derivative can be isolated, by nucleic acid hybridization under conditions 
5 of low, high, or medium stringency (Shilo and VVeinberg, 1981, Proc. 
Natl. Acad. Sci. USA 78:6789-6792). 

b. Selecting and producing mutant DNA repair enzymes 
Once nucleic acids encoding the DNA repair enzymes are obtained, 
these nucleic acids can be mutagenized and screened and/or selected for 

10 DNA repair enzymes that substantially retain their binding affinity or have 
enhanced binding affinity for abnormal base-pairing but have attenuated 
catalytic activity. Insertion, deletion or point mutation(s) can be 
introduced into nucleic acids encoding the DNA repair enzymes. 
Techniques for mutagenesis known in the art can be used, including, but * 

15 not limited to, in vitro site-directed mutagenesis (Hutchinson et aL, 1978, 
J. Biol, Chem 253 :6551), use of TAB® linkers (Pharmacia), mutation- 
containing PCR primers, etc. Mutagenesis can be followed by phenotypic 
testing of the altered gene product. 

Site-directed mutagenesis protocols can take advantage of vectors 

20 that provide single stranded as well as double stranded DNA, as needed. 
Generally, the mutagenesis protocol with such vectors is as follows. A 
mutagenic primer, Le., a primer complementary to the sequence to be 
changed, but including one or a small number of altered, added, or 
deleted bases, is synthesized. The primer is extended in vitro by a DNA 

25 . polymerase and, after some additional manipulations, the now double- 
stranded DNA is transfected into bacterial cells. Next, by a variety of 
methods, the desired mutated DNA is identified, and the desired protein is 
purified from clones containing the mutated sequence. For longer 
sequences, additional cloning steps are often required because long 

30 inserts (longer than 2 kilobases) are unstable in those vectors. Protocols 
are known to one skilled in the art and kits for site-directed mutagenesis 
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are widely available from biotechnology supply companies, for example 
from Amersham Life Science, Inc. (Arlington Heights, IL) and Stratagene 
Cloning Systems (La Jolla, CA). 

Information regarding to the structural-function relationship of the 
5 DNA repair enzymes can be used in the mutagenesis and selection of 
DNA repair enzymes that substantially retain their binding affinity or have 
enhanced binding affinity for the abnormal base-pairing but have 
attenuated catalytic activity. For example, mutants can be made in the 
enzyme's binding site for its co-enzyme, cp-factor, or in the mutant 

10 enzyme's catalytic site, or a combination thereof. 

Once a mutant DNA repair enzyme with desired properties, i.e., 
substantially retaining its binding affinity or having enhanced binding 
affinity for the abnormal base-pairing but has attenuated catalytic 
activity, is identified, such mutant DNA repair enzyme can be produced 

1 5 by any methods known in the art including recombinant expression, 

chemical synthesis or a combination thereof. Preferably, the mutant DNA 
repair enzyme is obtained by recombinant expression. 

For recombinant expression, the mutant DNA repair enzyme gene 
or portion thereof is inserted into an appropriate cloning vector for 

20 expression in a particular host cell. A large number of vector-host 

systems known in the art may be used. Possible vectors include, but are 
not limited to, plasmids or modified viruses, but the vector system must 
be compatible with the host cells used. Such vectors include, but are not 
lirifiited to, bacteriophages such as lambda derivatives, or plasmids such 

25 as pBR322 or pUC plasmid derivatives or the Bluescript vector 

(Stratagene). The insertion into a cloning vector can, for example, be 
accomplished by ligating the DNA fragment into a cloning vector which 
has complementary cohesive termini. If, however, the complementary 
restriction sites used to fragment the DNA are not present in the cloning 

30 vector, the ends of the DNA molecules can be enzymatically modified. 
Alternatively, a desired site can be produced by ligating sequences of 
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nucleotides (linkers) onto the DNA termini; these ligated linkers can 
include specific oligonucleotides encoding restriction endonuclease 
recognition sequences. Recombinant molecules can be introduced into 
host cells via transformation, transfection, infection, electroporation, etc., 
5 so that many copies of the gene sequence are generated. 

In an alternative method, the desired gene can be identified and 
isolated after insertion into a suitable cloning vector in a "shot gun" 
approach. Enrichment for the desired gene, for example, by size 
fractionation, can be done before insertion into the cloning vector. 

10 In specific embodiments, transformation of host cells with 

recombinant DNA molecules that incorporate the isolated mutant DNA 
repair enzyme gene, cDNA, or synthesized DNA sequence enables 
generation of multiple copies of the gene. Thus, the gene can be 
obtained in large quantities by growing transformants, isolating the 

1 5 recombinant DNA molecules from the transformants and, when 

necessary, retrieving the inserted gene from the isolated recombinant 
DNA. 

The nucleotide sequence coding for a mutant DNA repair enzyme 
or a functionally active analog or fragment or other derivative thereof, can 

20 be inserted into an appropriate expression vector, e.g., a vector which 
contains the necessary elements for the transcription and translation of 
the inserted protein-coding sequence. The necessary transcriptional and 
translational signals, can also be supplied by the native mutant DNA repair 
enzyme gene and/or its flanking regions. A variety of host-vector 

25 systems can be utilized to express the protein*coding sequence. These 
systems include but are not limited to mammalian cell systems infected 
with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus (e.^., baculovirus); microorganisms such as yeast 
containing yeast vectors, or bacteria transformed with bacteriophage, 

30 DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors 
vary in their strengths and specificities. Depending on the host-vector 
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system utilized, suitable transcription and translation elements c^n be 
used. 

The methods previously described for the insertion of DNA 
fragments into a vector can be used to construct expression vectors 
5 containing a chimeric gene containing appropriate transcrip- 

tional/translational control signals and the protein coding sequences. 
These methods can include in vitro recombinant DNA and synthetic 
techniques and in yivo recombinants (genetic recombination). Expression 
of a nucleic acid sequence encoding a mutant DNA repair enzyme or 

10 peptide fragment can be regulated by a second nucleic acid sequence so 
that the mutant DNA repair enzyme or peptide is expressed in a host 
transformed with the recombinant DNA molecule. For example, 
expression of a mutant DNA repair enzyme can be controlled by a 
promoter/enhancer element as is known in the art. Promoters which can • 

1 5 be used to control a mutant DNA repair enzyme expression include, but 
are not limited to, the SV40 early promoter region (Bernoist and 
Chambon, 1981, Nature 290 :304-31 0)> the promoter contained in the 3' 
long terminal repeat of Rous sarcoma virus (Yamamoto, et a!., 1980, Cell 
22:787-797), the herpes thymidine kinase promoter (Wagner et al., 

20 1981, Proc. Natl. Acad, Sci. U.S.A. 78:1441-1445), the regulatory 
sequences of the metallothioneine gene (Brinster et al., 1982, Nature 
296 :39-42); prokaryotic. expression vectors such as the ;?-lactamase 
promoter (Villa-Kamaroff, et al., 1978, Proc. Natl. Acad. Sci. U.S.A. 
75:3727-3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. 

25 Acad. Sci. U.S.A. 80:21-25); see also "Useful proteins from recombinant 
bacteria" in Scientific American, 1980, 242 :74-94: promoter elements 
from yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol 
dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, 
alkaline phosphatase promoter, and certain animal transcriptional control 

30 regions. 

For example, a vector can be used that contains a promoter 
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operably linked to a nucleic acid encoding a mutant DNA repair enzyme, 
one or more origins of replication, and, optionally, one or more selectable 
markers (e.g., an antibiotic resistance gene). 

In a specific embodiment, an expression construct is made by 
5 subcloning a mutant DNA repair enzyme coding sequence into the EcoR\ 
restriction site of each of the three pGEX vectors (Glutathione S- 
Transferase expression vectors; see, e.g., Smith and Johnson, 1988, 
Gene 7:31-40). This allows for the expression of a mutant DNA repair 
enzyme product from the subclone in the correct reading frame. 

10 Expression vectors containing a mutant DNA repair enzynqe gene 

inserts can be identified by three general approaches: (a) nucleic acid 
hybridization, (b) presence or absence of "marker" gene functions, and |c) 
expression of inserted sequences. In the first approach, the presence of 
a mutant DNA repair enzyme gene inserted in an expression vector can be ■ 

1 5 detected by nucleic acid hybridization using probes containing sequences 
that are homologous to an inserted mutant DNA repair enzyme gene. In 
the second approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of certain 
"marker" gene functions (e.g., thymidine kinase activity, resistance to 

20 antibiotics, transformation phenotype, occlusion body formation in 

baculovirus, etc.) caused by the insertion of a mutant DNA repair enzyme 
gene in the vector. For example, if the mutant DNA repair enzyme gene 
is inserted within the marker gene sequence of the vector, recombinants 
containing the mutant DNA repair enzyme insert can be identified by the 

25 absence of the marker gene function. In the third approach, recombinant 
expression vectors can be identified by assaying the mutant DNA repair 
enzyme product expressed by the recombinant. Such assays can be 
based, for example, on the physical or functional properties of the mutant 
DNA repair enzyme in in vitro assay systems, e.g., binding with 

30 anti-mutant DNA repair enzyme antibody. 

Once a particular recombinant DNA molecule is identified and 
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isolated, several methods known in the art can be used to propagate it. 
Once a suitable host system and growth conditions are established, 
recombinant expression vectors can be propagated and prepared in 
quantity. As previously explained, the expression vectors which can be 
5 used include, but are not limited to, the following vectors or their 

derivatives: human or animal viruses such as vaccinia virus or adenovirus; 
insect viruses such as baculovirus; yeast vectors; bacteriophage vectors 
{e.g., lambda), and plasmid and cosmid DNA vectors, to name but a few. 
In addition, a host cell strain can be chosen which modulates the 

10 expression of the inserted sequences, or modifies and processes the gene 
product in the specific fashion desired. Expression from certain 
promoters can be elevated in the presence of certain inducers; thus, 
expression of the genetically engineered mutant DNA repair enzyme can 
be controlled. Furthermore, different host cells have characteristic and 

1 5 specific mechanisms for the translational and post-translational 
. processing and modification {e.g., glycosylation, phosphorylation) of 
proteins. Appropriate cell lines or host systems can be chosen to ensure 
the desired modification and processing of the foreign protein expressed. 
For example, expression in a bacterial system can be used to produce an 

20 unglycosylated core protein product. Expression in yeast will produce a 
glycosylated product. Expression in appropriate animal cells can be used 
to ensure "native" glycosylation of a heterologous protein. Furthermore, 
different vector/host expression systems can effect processing reactions 
to different extent. 

25 

c. Mutant mutL or MLH1 

In a specific embodiment, a mutant mutL or MLH1 is used in the 
present methods. The nucleic acid molecules containing sequences of 
30 nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding mutL and in mutagenesis: AF170912 
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(Caulobacter crescentus), AI518690 {Drosophila melanogaster), 
AI456947 (Drosophila melanogaster), AI389544 (Drosophila 
melanogaster), AI387992 (Drosophila melanogaster), AI292490 
(Drosophila melanogaster), AF068271 (Drosophila melanogaster), 
5 AFQ68257 (Drosophila melanogaster), U50453 (Thermus aquaticus), 
U27343 (Bacillus subtilis), U71053 (U71053 (Thermotoga maritima), 
U71052 (Aquifex pyrophilus), U13696 (Human), U13695 (Human), 
M29687 (S.typhimurium), M63655 (E. coli) and LI 9346 (Escherichia 
coli). The nucleic acid molecules containing sequences of nucleotides 

10 with the following GenBank accession Nos. can be used in obtaining 
nucleic acid encoding MLH1 and in mutagenesis: AI389544 (Drosophila 
melanogaster), AI387992 (Drosophila melanogaster), AF068257 
(Drosophila melanogaster), U80054 (Rattus norvegicus) and U07187 
(Saccharomyces cerevisiae). 

15 In a preferred embodiment, mutant mutL or MLH1 used in the 

present methods has a mutation in its catalytic site, ATP binding site or 
combination thereof (Ban and Yang, Cell, 95:541-552 (1998)). 

In another preferred embodiment, the mutant mutL used in the 
present methods is an f. Co// mutant mutL having a E29K, E32K, A37T, 

20 D58N, G60S, G93D, R95C, G96S, G96D, S112U A16T, A16V, P305L, 
H308Y, G238D, S106F or A271V mutation (Aronshtam and Marinus, 
Nucleic Acids Res,, 24(13) :2498-504 (1996)). 

In still another preferred embodiment, the mutant MLH1 used in the 
present methods is a human mutant MLH1 having a P28L, M35R, S44F, 

25 G67R, I68N, I107R, T117R, T117M, R265H, V185G or G224D mutation 
(Peltomaki and Vasen, Gastroenterology, 1 13(4) : 11 46-58 (1997)). 
d. Mutant MutS 
In another specific erhbodiment, a mutant mutS is used in the 
present methods. The nucleic acid molecules containing sequences of 

30 nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding mutS and in mutagenesis: AF1 46227 
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(Mus musculus), AF193018 (Arabidopsis thaliana), AF1 44608 (Vibrio 
parahaemolyticus), AF034759 (Homo sapiens), AF1 04243 (Homo 
sapiens), AF007553 (Thermus aquaticus caldophilus), AF1 09905 (Mus 
musculus), AF070079 (Homo sapiens), AF070071 (Homo sapiens), 
5 AH006902 (Homo sapiens), AF048991 (Homo sapiens), AF048986 
(Homo sapiens), U33117 (Thermus aquaticus), U16152 (Yersinia 
enterocolitica), AF000945 (Vibrio cholarae), U698873 (Escherichia coli), 
AF003252 (Haemophilus influenzae strain b (Eagan), AF003005 
(Arabidopsis thaliana), AF002706 (Arabidopsis thaliana), L10319 

10 (Mouse), D63810 (Thermus thermophilus), U27343 (Bacillus subtilis), 
U71155 (Thermotoga maritima), U71154 (Aqulfex pyrophilus), U16303 
(Salmonella typhimurium), U21011 (Mus musculus), M84170 (S. 
cerevisiae), M84169 (S. cerevisiae), Ml 8965 (S, typhimurium) and 
M63007 (Azotobacter vinelandii). 

1 5 Preferably, the mutant mutS used in the present methods has a 

mutation in its catalytic site, dimerization site, mutL interaction site or a 
combination thereof. Also preferably, the mutant mutS used in the 
present methods is an E Co// mutant mutS (see, e.g., Wu ef a/., J. Biol. 
Chem., 274191:5948-52 (1999)). 

20 e. Mutant MutM 

In still another specific embodiment, a mutant mutM is used in the 
present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding mutM and in mutagenesis: AF148219 

25 (Nostoc PCC8009), AF026468 (Streptococcus mutans), AF093820 
(Mastigocladus laminosus), AB010690 (Arabidopsis thaliana), U40620 
(Streptococcus mutans), AB008520 (Thermus thermophilus) and 
AF026691 (Homo sapiens). 

Preferably, the mutant mutM used in the present methods has a 

30 mutation in its catalytic site, mutY interaction site or combination thereof 
(Michaels et al., Proc. Natl. Acad. ScL U.S.A., 89(151:7022-5 (1992)). 
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Also preferably, the mutant mutM used in the present methods is an 
E.CoH mutant mutM having a K57G or K57R mutation (Sidorkina and 
Laval, Nucleic Acids Res, 261231:5351-7 (1998)). 

f . Mutant MutY 

5 In yet another specific embodiment, a mutant mutY is used in the 

present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding mutY and in mutagenesis: AF121797 
(Streptomyces), U63329 (Human), AA409965 (Mus musculus) and 

10 AF056199 (Streptomyces). 

Preferably, the mutant mutY used in the present methods has a 
mutation in its catalytic site, mutM interaction site or combinatton thereof 
(Michaels et aL, Proc. NatL Acad. ScL U.S.A., 89(15) :7022-5 (1992)). 
Also preferably, the mutant mutY used in the present methods is an 

15 E Co// mutant mutY having an E37S, V45N, G1 16D, D138N or K142A 
mutation (Lu et a!., J. Biol. Chem., 271 (39): 241 38-43 (1996); Guan et 
a!., Nat. Struct. Biol., 5021:1058-64 (1998); and Wright et al., J. Biol. 
Chem,, 274(411 :2901 1-18 (1999)), More preferably, the abnormal base- 
pairing to be detected is a A:C mismatch and the mutant DNA repair 

20 enzyme used in the present methods is a mutant MutY. 

g. Mutant uvrD 

In yet another specific embodiment, a mutant uvrD is used in the 
present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
25 obtaining nucleic acid encpding uvrD and in mutagenesis: L02122 (E. 
coli), AF028736 (Serratia marcescens), AF010185 (Pseudomonas 
aeruginosa), D00069 (Escherichia coli), AB001291 (Thermus 
thermophilus), M38257 (Escherichia coli) and L22432 (Mycoplasma 
capricolum). 

30 Preferably, the mutant uvrD used in the present methods has a 

mutation in its catalytic site, ATP binding site or combination thereof. 
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Also preferably, the mutant uvrD used in the present methods is an E.CoH 
mutant uvrD having a K35M, D220NE221Q, E221Q or Q251E mutation 
(Brosh and Matson, J. BacterioL, 177(19) ;5612-21 (1995); George at al., 
J. MoL BioL, 235121:424-35 (1994); and Brosh and Matson, J. Biol. 
5 Chem., 272(1 ) :572-79 (1997)). 

h. Mutant MSH2 

In yet another specific embodiment, a mutant MSH2 is used in the 
present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
10 obtaining nucleic acid encoding l\/ISH2 and in mutagenesis: AF1 09243 
(Arabidopsis thaliana), AF030634 (Neurospora crassa), AF002706 
(Arabidopsis thaliana), AF026549 (Arabidopsis thaliana), L47582 (Homo 
sapiens), L47583 (Homo sapiens), L47581 (Homo sapiens) and M84170 
(S. cerevisiae). 

1 5 Preferably, the mutant MSH2 used in the present methods has a 

mutation in its catalytic site, ATP binding site, ATPase site or 
combination thereof. Also preferably, the mutant MSH2 used in the 
present methods is a S. cerevisiae mutant MSH2 having a G693P or a 
G855D mutation (Alani et al.. Mo/. Cell. Biol., 17(51 :2436-47 (1997)), or 

20 a human mutant MSH2 having a fragment encoding 195 amino acids 
within the C-terminal domain of hMSH-2 or having a K675R mutation 
(Whitehouse et al., Biochem. Biophys. Res. Commun., 232(1) : 10-3 
(1997); and laccarino et al., EMBOJ., 17(9) :2677-86 (1998)). 

i. Mutant MSH6 

25 In yet another specific embodiment, a mutant MSH6 is used in the 

present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding MSH6 and in mutagenesis: U54777 
(Homo sapiens) and AF031087 (Mus musculus). 

30 Preferably, the mutant MSH6 used in the present methods has a 

mutation in its catalytic site, ATP binding site, ATPase site or 
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combination thereof. Also preferably, the mutant MSH6 used in the 
present methods is a human mutant MSH6 having a Kl 140R mutation 
(laccarino et al., EMBO J., 17(9) :2677-86 (1 998)). More preferably, the 
mutant DNA repair complex used in the present methods comprises a 
5 human mutant MSH2 having a K675R mutation and a human mutant 
MSH6 having a Kl 140R mutation. 

j. Mutant T4 endonuclease V 
In yet another specific embodiment, a mutant T4 endonuclease V is 
used in the present methods. The nucleic acid molecules containing 
10 sequences of nucleotides with the following GenBank accession Nos. can 
be used in obtaining nucleic acid encoding T4 endonuclease V and in 
mutagenesis: M35392 (Synthetic), U76612 (Coliphage), U48703 
(Bacteriophage T4) and M23414 (Synthetic). Preferably, the mutant T4 
endonuclease V used in the present methods has a E23Q mutation (Doi et - 
15 al., Proc. Natl. Acad. ScL U.S.A., 89(20) :9420-4 (1992)). 
k. Mutant MSH3 
In yet another specific embodiment, a mutant MSH3 is used in the 
present methods. The nucleic acid molecules containing sequences of 
. nucleotides with the following GenBank accession Nos. can be used in 
20 obtaining nucleic acid encoding MSH3 and in mutagenesis: J04810 
(Human) and M96250 (Saccharomyces cerevisiae). 
I. Mutant alkA 
In yet another specific embodiment, a mutant alkA is used in the 
present methods. The nucleic acid molecules containing sequences of 
25 nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding alkA and in mutagenesis: D14465 
(Bacillus subtllis) and K02498 (E. coli). 

m. Mutant Exonuclease I 
In yet another specific embodiment, a mutant exonuclease I is used 
30 in the present methods. The nucleic acid molecules containing sequences 
of nucleotides with the following GenBank accession Nos. can be used in 
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obtaining nucleic acid encoding exonuclease I and in nnutagenesis: 
AF060479 (Homo sapiens), U86134 (Saccharomyces cerevisiae) and 
J02641 (E. coli). 

n. Mutant feni 

5 In yet another specific embodiment, a mutant feni is used in the 

present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding feni and in mutagenesis: AF065397 
{Xenopus laevis (FEND) and AF036327 (Xenopus laevis (FENI)). 

10 o. Mutant rpa 

In yet another specific embodiment, a mutant rpa is used in the 
present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding rpa and in mutagenesis: AA955716 

15 (Homo sapiens), AA955320 (Homo sapiens), AA925949 (Homo 

sapiens), a29383 (Zea mays), U33419 (Orf virus) and L07493 (Homo 
sapiens). 

p. Mutant pcna 

In yet another specific embodiment, a mutant pcna is used in the 
20 present methods. The nucleic acid molecules containing sequences of 
nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding pcna and in mutagenesis: AB025029 
(Nicotiana tabacum), AF038875 (Nicotiana tabacum), AF104412 
(Nicotiana tabacum), AA925316 (Rattus norvegicus), AA924358 (Rattus 
25 norvegicus), AA923907 (Rattus norvegicus), AA901212 (Rattus 

norvegicus), AA858643 (Rattus norvegicus), AA441 366 (Drosophila 
melanogaster), AA440162 (Drosophila melanogaster), L42763 (Styela 
clava), AF085197 (Nicotiana tabacum), AF020427 (Sarcophaga 
crassipalpis), AB002264 (Bombyx mori), J04718 (Human), M34080 
30 (X.laevis) and M33950 (D. melanogaster). 

q. Mutant Replication factor C 
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In yet another specific embodiment, a mutant replication factor C is 
used in the present methods. The nucleic acid molecules containing 
sequences of nucleotides with the following GenBank accession Nos. can 
be used in obtaining nucleic acid encoding replication factor C and in 
5 mutagenesis: AF1 39987 (Mus musculus), AA924760 (Homo sapiens), 
AA901331 (Homo sapiens), AA900852 {Homo sapiens), AA899302 
(Homo sapiens), AA819500 (Rattus norvegicus), U60144 (Anas 
platyrhynchos), U26031 (Saccharomyces cerevisiae), U26030 
(Saccharomyces cerevisiae), U26029 (Saccharomyces cerevisiae), 

10 U26028 (Saccharomyces cerevisiae), U26027 (Saccharomyces 

cerevisiae), AF045555 (Homo sapiens), U86620 (Emericella nidulans), 
U86619 (Emericella nidulans), D28499 (Yeast), U07685 (Drosophila 
melanogaster), M87338 (Human), M87339 (Human), L07540 (Human), 
L07541 (Human), L20502 (Saccharomyces cerevisiae), LI 8755 

15 (Saccharomyces cerevisiae), U12438 (Gallus gallus Leghorn) and L23320 
(Human). 

r. Mutant Uracil DNA glycosylase 

In yet another specific embodiment, a mutant uracil DNA 
glycosylase (UDG) is used in the present methods. The nucleic acid 

20 molecules containing sequences of nucleotides with the following 

GenBank accession Nos. can be used in obtaining nucleic acid encoding 
uracil DNA glycosylase and in mutagenesis: AFl 74292 
(Schizosaccharomyces pombe), AF1 08378 (Cercopithecine herpesvirus), 
AF125182 (Homo sapiens), AFl 251 81 (Xenopus laevis), U55041 (Homo 

25 sapiens), U55041 (Mus musculus), AF084182 (Guinea pig 

cytomegalovirus), U31 857 (Bovine herpesvirus), AF022391 (Feline 
herpesvirus), M87499 (Human), J04434 (Bacteriophage PBS2), U13194 
(Human herpesvirus 6), L34064 (Gallid herpesvirus 1), U04994 (Gallid 
herpesvirus 2), L01417 (Rabbit fibroma virus), M25410 (Herpes simplex 

30 virus type 2), J04470 (S.cerevisiae), J03725 (E.coli), U02513 (Suid 
herpesvirus), U02512 (Suid herpesvirus) and LI 3855 (Pseudorabies 
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virus). 

s. Mutant Thymidine DIM A glycosyiase 
In yet another specific embodiment, a mutant thymidine DNA 
glycosyiase (TDG) is used in the present methods. The nucleic acid 
5 molecules containing sequences of nucleotides with the following 

GenBank accession Nos. can be used in obtaining nucleic acid encoding 
thymidine DNA glycosyiase and in mutagenesis: AF1 17602 (Ateles 
paniscus chamek). Preferably, the abnormal base-pairing to be detected 
is a G:T mismatch and the mutant DNA repair enzyme used in the present 
10 methods is a mutant TDG (Hsu et al., Carcinogenesis, 15(8) :1 657-62 
(1994)). 

t. l\/lutant dam 
In yet another specific embodiment, a mutant dam is used ip the 
present methods. The nucleic acid molecules containing sequences of 

1 5 nucleotides with the following GenBank accession Nos. can be used in 
obtaining nucleic acid encoding dam and in mutagenesis: AF091142 
(Neisseria meningitidus strain BF13), AF006263 (Treponema pallidum), 
U76993 (Salmonella typhimurium) and l\/l22342 (Bacteriphage T2). 
2. Detecting the binding of the mutant enzyme 

20 . Binding of the mutant enzyme to a duplex can be detected by any 

method known to those of skill in the art for detection of proteins. The 
enzyme may be specifically labeled, such as with a fluorescent label, 
radiolabeled, tagged with a readily tag that can be readily purified, labeled 
with another enzyme, or antibody. In an exemplary embodiment, bibtin 

25 is bound to the mutant enzyme, which can then interact with a 

streptavidin-labeled moiety, such a horse radish peroxidase (HRPO), 
which upon reaction with an appropriate substrate will form a colored 
product. 

For example, an array of nucleic acid probes, containing for 
30 example, from about 20 to about 50 up to about 100 nucleotides, are 
hybridized with single-stranded nucleic acid from a sample. The hybids 
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are contacted with a selected or a plurality of mutant enzymes, which are 

labeled with biotin. After contacting the biotin reacts with streptavidin 

which is labeled, such as with HRPO, and the bound mutant enzyme is 

detected by virtue of the formation of detectable product, such as colored 

5 product. If the probes on the array are of known sequence, selected, for 

example for inclusion of polymorphisms, then upon reaction, the presence 

or absence of an array of polymorphism in the sample can be rapidly and 

. readily identified. 

C. METHODS FOR DETECTING MUTATIONS IN NUCLEIC ACIDS FOR 
10 PROGNOSIS AND DIAGNOSIS OF DISEASES, DISORDERS AND 

INFECTIONS 

Also provided herein are methods for detecting a mutations in a 
nucleic acid molecule for diagnostic and prognostic applications. These 
methods involve binding a mutant nucleic acid binding enzyme, such as a 

1 5 mutant repair enzyme to nucleic acids in sample, such as body tissue or 
fluid sample, and detecting the bound mutant enzyme. These reactions, 
can be performed in solution, or, preferably in solid phase. 

In one embodiment, single-stranded nucleic acids, either those 
known to be wild type or with a mutation indicative of a particular 

20 disorder are hybridized with the sample nucleic acid. The resulting 
duplexes are contacted with a selected mutant enzyme or a plurality 
thereof that contain different specificities. The resulting complexes, 
which are indicative a difference in sequence between the strands in the 
sample from the known strands, are detected. These methods can be 

25 performed in solution or preferably in solid phase. In a preferred 
embodiment, the single-stranded nucleic acids containing known 
sequences are on the solid support. In others, the enzymes of known 
specificities can be bound on a solid support. Bound hybrids are 
indicative of the mutation present. 

30 In a preferred embodiment, the method is performed by hybridizing 

a strand of a nucleic acid having or suspected of having a mutation with 
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a complementary strand of a wild-type nucleic acid (or with a strand 
having a known mutation), whereby the mutation results in an abnormal 
base-pairing in the formed nucleic acid duplex; contacting the nucleic acid 
duplex with a mutant DNA repair enzyme or complex thereof, where the 
5 mutant DNA repair enzyme or complex thereof has binding affinity for the 
abnormal base-pairing in the duplex but has attenuated catalytic activity; 
and detecting binding between the nucleic acid duplex and the mutant 
DNA repair enzyme or complex thereof, whereby the presence or quantity 
of the mutation is assessed. 

10 Any mutant DNA repair enzymes or complexes thereof that have 

binding affinity for the abnormal base-pairing in the duplex but have 
attenuated catalytic activity can be used in the mutation detection. 
Preferably, the mutant DNA repair enzymes or complexes thereof 
described in the above Section B can be used. Typically, the nucleic 

1 5 acid strand to be tested and the complementary wild-type nucleic acid 
strand are DNA strands. 

Mutations that can be detected by these methods, include those 
that are associated with or that are indicative of a disease or disorder or 
predilection thereto, or infection by a pathological agent. These methods 

20 can be used for prognosis or diagnosis of the presence or severity of the 
disease, disorder or infection. 

Any diseases, disorders or infections that are associated with a 
nucleic acid mutation or for which such mutation serves as a marker or 
indicator can be diagnosed or the tendency therefor prognosticated using 

25 the present methods. Such diseases and disorders include, but are not 
limited to, cancers, immune system diseases or disorders, metabolism 
diseases or disorders, muscle and bone diseases or disorders, nervous 
system diseases or disorders, signal diseases or disorders and transporter 
diseases or disorders. Infections include, but are not limited to, infections 

30 caused by viruses, eubacteria, archaebacteria and eukaryotic pathogens. 
Among the diseases or disorders that can be diagnosed or the 
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tendency to develop them, include but are not linnited to, a disease or 
disorder associated with an androgen receptor nnutation, tetrahydrobio- 
pterin deficiencies, X-Linked agammaglobulinennia, a disease or disorder 
associated with a factor VII mutation, anemia, a disease or disorder 
5 associated with a glucose-6-phosphate mutation, the glycogen storage . 
disease type II (Pompe Disease), hemophilia A, a disease or disorder 
associated with a hexosaniinidase A mutation, a disease or disorder 
associated with a human type I or type III collagen mutation, a disease or 
disorder associated with a rhodopsin or RDS mutation, a disease or 

10 disorder associated with a L1CAM mutation, a disease or disorder 

associated with a LDL receptor mutation, a disease or disorder associated 
with an ornithine transcarbamylase mutation, a disease or disorder 
associated with a PAX6 mutation and a disease or disorder associated 
with a von Willebrand factor mutation. 

15 1. Cancer 

Any cancers that are associated with a mutation (s) in a nucleic 
acid can be predicted or diagnosed using the present methods. For 
example, breast cancer, Burkitt lymphoma, colon cancer, small cell lung 
carcinoma, melanoma, multiple endocrine neoplasia (MEN), 

20 neurofibromatosis, p53-associated tumor, pancreatic carcinoma, prostate 
cancer, Ras-associated tumor, retinoblastoma and Vgn-Hippel Lindau 
disease (VHL) can be predicted or diagnosed using the present methods, 
a. Breadt cancer 
Two breast cancer susceptibility genes have been identified: 

25 BRCA1 on chromosome 17 and BRCA2 on chromosome 13. When an 
individual carries a mutation in either BRCA1 or BRCA2, they are at an 
increased risk of being diagnosed with breast or ovarian cancer at some 
point in their lives {Albertsen et al., Am. J, Hum. Genet, 54(3) :516-25 
(1 994); and Wooster et al.. Nature, 378(6559): 789-92 (1 995)). Until 

30 recently, it was not clear what the function of these genes was, until 
studies on a related protein in yeast revealed their normal role: they 
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participate in repairing radiation-induced breaks in double-stranded DNA. 
It is thouglit that mutations in BRCA1 or BRCA2 migfit disable this 
mechanism, leading to more errors in DNA replication and ultimately to 
cancerous growth. 
5 In a specific embodiment, the breast cancer to be predicted or 

diagnosed according to the present method is associated with a mutation 
in BRCA1 or BRCA2. 

b. Burkitt lymphoma 

Burkitt lymphoma results from chromosome translocations that 
10 involve the Myc gene. A chromosome translocation means that a 

chromosome is broken, which allows it to associate with parts of other 

chromosomes (Adams et al., Proc. Natl. Acad. Sci. U.S.A., 80(7) : 1982-6 

(1983); Watt et al.. Nature, 303(591 9) :725-8 (1983); and Cole, Annu. 

Rev. Genet., 20:361-84 (1986)). The classic chromosome translocation 
15 in Burkitt lymphoma involves chromosome 8, the site of the Myc gene. 

This changes the pattern of Myc's expression, thereby disrupting its usual 

function in controlling cell growth and proliferation. 

In a specific embodiment, the Burkitt lymphoma to be predicted or 

diagnosed according to the present method is associated with a mutation 
20 in Myc. 

c. Colon cancer 

Colon cancer is one of the most common inherited cancer 
syndromes known. Two key genes involved in colon cancer. have been 
found: MSH2, on chromosome 2 and MLH1, on chromosome 3. 

25 Normally, the protein products of these genes help to repair mistakes 
made in DNA replication. If the MSH2 and MLH1 proteins are mutated 
and therefore don't work properly, the replication mistakes are not 
repaired, leading to damaged DNA and, in this case, colon cancer 
(Bronner et al., Nature, 368(6468): 258-61 (1994); and Fishel et al., Cell, 

30 75151:1027-38 (1993)). 

In a specific embodiment, the colon cancer to be predicted or 
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diagnosed according to the present method is associated with a mutation 
inMSH2orMLH1. 

d. Small cell lung carcinoma 

Small cell lung carcinoma is distinctive from other kinds of lung 
5 cancer (metastases are already present at the time of discovery) and 
accounts for approximately 110,000 cancer diagnoses annually. A 
deletion of part of chromosome 3, SCLC1, was first observed in 1982 in 
small cell lung carcinoma cell lines (Whang-Peng et al., Science, 
215145291:181-2 (1982)). 
10 In a specific embodiment, the small cell lung carcinoma to be 

predicted or diagnosed according to the present method is associated 
with a mutation in SCLG1. 

e. Melanoma carcinoma 

In some cases, the risk of developing melanoma runs in families, 
1 5 where a mutation in the CDKN2 gene on chromosome 9 can underlie 
susceptibility to melanoma (Hussussian et al., Nat. Genet., 8(1) :15-21 
(1994)). CDKN2 codes for a protein called pi 6 that is an important 
regulator of the cell division cycle: It stops the cell from synthesizing DNA 
before it divides. If pi 6 is not working properly, the skin cell does not 
20 have this brake on the cell division cycle, and so can go on to proliferate 
unchecked. At some point this proliferation can be seen as a sudden 
change in skin growth or the appearance of a mole. 

In a specific embodiment, the melanoma carcinoma to be predicted 
or diagnosed according to the present method is associated with a 
25 mutation in CDKN2. 

f . Multiple endocrine neoplasia 

Multiple endocrine neoplasia (MEN) is a group of rare diseases 
caused by genetic defects that lead to hyperplasia (abnormal 
multiplication or increase In the number of normal cells in normal 
30 arrangement in a tissue) and hyperfunction (excessive functioning) of 2 or 
more components of the endocrine system. Normally, the hormones 
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released by endocrine glands are carefully balanced to met the body's 
needs. When a person has MEN, specific endocrine glands, such as the 
parathyroid glands, the pancreas gland and the pituitary gland, tend to 
become overactive. When these glands go into overdrive, the result can 
5 be: excessive calcium in the bloodstream (resulting in kidney stones or 
kidney damage); fatigue; weakness; muscle or bone pain; constipation; 
indigestion; and thinning of bones. The MENl gene, which has been 
known for several years to be found on chromosome 1 1 , was more finely 
mapped in 1997 (Chandrasekharappa et al., Science, 276(531 1 ) :404- 7 
10 (1997)). In a specific embodiment, the MEN to be diagnosed or 

predicted according to the present method is associated with a mutation 
in MENl. 

g. Neurofibromatosis 

Neurofibromatosis, type 2 (NF-2), is a rare inherited disorder 
1 5 characterized by the development of benign tumors on auditory nerves 
(acoustic neuromas). The disease is also characterized by the 
development of malignant central nervous system tumors as well. The 
NF2 gene has been mapped to chromosome 22 and is thought to be a 
'tumor-suppressor gene' (Rouleau et al., Nature, 363(64291:515-21 
20 (1993)). A mutation in NF2 impairs its function, and accounts for the . 
clinical symptoms observed in neurofibromatosis sufferers. NF-2 is an 
autosomal dominant genetic trait; it affects both genders equally and 
each child of an affected parent has a 50% chance of inheriting the gene. 
In a specific embodiment, the neurofibromatosis to be predicted or 
25 diagnosed according to the present method is associated with a mutation 
in NF2. 

h. Cancer associated with p53 mutation 

The p53 gene is a tumor suppressor gene (Harlowet al., Mol. Cell. 
Biol., 5171:1601-10 (1985)). If a person inherits only one functional copy 
30 of the p53 gene from their parents, they are predisposed to cancer and 
usually develop several independent tumors in a variety of tissues in early 
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adulthood. This condition is rare, and is known as Li-Fraumeni syndrome. 
Mutations in p53 are found in most tumor types, and so contribute to the 
complex network of molecular events leading to tumor formation. The 
p53 gene has been mapped to chromosome 17. In the cell, p53 protein 
5 binds DNA, which in turn stimulates another gene to produce a protein 
called p21 that interacts with a cell division-stimulating protein (cdk2). 
When p21 is complexed.with cdk2 the cell cannot pass through to the 
next stage of cell division. Mutant p53 can no longer bind DNA in an 
effective way, and as a consequence the p21 protein is not made 

10 available to act as the 'stop signal' for cell division. Thus cells divide 
uncontrollably, and form tumors. 

In a specific embodiment, the cancer to be predicted or diagnosed 
according to the present method is associated with a mutation in p53. 
i. Pancreatic carcinoma 

15 ' About 90% of human pancreatic carcinomas show a loss, of part of 

chromosome 18. In 1996, a possible tumor suppressor gene, DPC4 
{Smad4), was discovered from the section that is lost in pancreatic 
cancer, so may play a role in pancreatic cancer (Hahn et al., Science, 
271(5247) :350-3 (1996)). There is a whole family of Smad proteins in 

20 vertebrates, all involved in signal transduction of transforming growth 
factor-beta (TGF-beta) related pathways. 

In a specific embodiment, the pancreatic carcinoma to be predicted 
or diagnosed according to the present method is associated with a 
mutation in DPC4 (Smad4). 

25 j. Prostate cancer 

One of the most promising recent breakthroughs in prostate cancer 
research is the discovery of a susceptibility locus for prostate cancer on 
chromosome 1, called HPC1, which may account for about 1 in 500 
cases of prostate cancer (Smith et al.. Science, 274(5291) :1371-4 

30 (1996)). 

In a specific embodiment, the prostate cancer to be predicted or 
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diagnosed according to the present method is associated with a mutation 
InHPCI. 

k. Cancer associated with Ras oncogene 

Ras is an oncogene product that is found on chromosome 11. It is 
5 found in normal cells, where It helps to relay signals by acting as a switch 
(Lowy and Willumsen, Annu. Rev. Biochem., 62:851-91 (1993); Russell 
et aL, Genomics, 35(2) :353-6Q (1996); and Tong et al,. Nature, 
337(62021:90-3 (1989)). When receptors on the cell surface are 
stirnulated (by a hormone, for example), Ras is switched on and 

10 transduces signals that tell the cell to grow. If the cell-surface receptor is 
not stimulated, Ras is not activated and so the pathway that results in 
cell growth is not initiated. In about 30% of human cancers, Ras is 
mutated so that it is permanently switched on, telling the cell to grow 
regardless of whether receptors on the cell surface are activated or not. 

15 In a specific embodiment, the cancer to be predicted or diagnosed 

according to the present method is associated with a mutation in Ras 
oncogene. 

I. Retinoblastoma 

Retinoblastoma occurs in early childhood and develops from the 
20 immature retina - the part of the eye responsible for detecting light and 
color. There are hereditary and non-hereditary forms of retinoblastoma. 
In the hereditary form, multiple tumors are found in both eyes, while in 
the non-hereditary form only one eye is effected and by only one tumor. 
In the hereditary form, a gene called Rb is lost from chromosome 13 
25 (Friend et al.. Nature, 323(6089): 643-6 (1986); and Lee et al.. Science, 
235147941:1394-9 (1987)). Rb is found in all cells of the body, where 
under normal conditions it acts as a brake on the cell divisiori cycle by 
preventing certain regulatory proteins from triggering DNA replication. If 
Rb is missing, a cell can replicate itself over and over in an uncontrolled 
30 manner, resulting in tumdr formation. 

In a specific embodiment, the retinoblastoma to be predicted or 
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diagnosed according to the present method is associated with a mutation 
in Rb gene. 

. m. Von-Hippel Lindau syndrome 

Von-Hippel Lindau syndrome is an inherited multi-system disorder 
5 characterized by abnormal growth of blood vessels. While blood vessels 
normally grow like trees, in people with VHL little knots of blood 
capillaries sometimes occur. These knots are called angiomas or 
hemangioblastomas. Growths may develop in the retina, certain areas of 
the brain, the spinal cord, the adrenal glands and other parts of the body. 

10 The gene for Von-Hippel Lindau disease (VHL) is found on chromosome 
3, and is inherited in a dominant fashion (Latif et al., Science, 
260151121:1 31 7-20 (1 993)). If one parent has a dominant gene, each 
child has a 50-50 chance of inheriting that gene. The VHL gene is a 
tumor suppressor gene. 

15 In a specific embodiment, the Von-Hippel Lindau syndrome to be 

predicted or diagnosed according to the present method is associated 
with a mutation in VHL gene. 

2. Immune system disease or disorder 

Any immune system diseases or disorders that are associated with 
20 a mutation(s) in a nucleic acid can be predicted or diagnosed using the 
present methods. For example, autoimmune polyglandular syndrome type 
I (APS1, also called APECED), inflammatory bowel disease (IBD), 
DiGeorge syndrome, familial Mediterranean fever (FMF) and severe 
combined immunodeficidncy (SCID) can be predicted or diagnosed using 
25 the present methods. 

a. Autoimmune polyglandular syndrome type I 
Autoimmune polyglandular syndrome type I (APS1, also called 
APECED) is a rare autosomal recessive disorder that maps to human 
chromosome 21 . At the end of 1997, researchers reported that they 
30 isolated a novel gene, which they called AIRE (autoimmune regulator). 
Database searches revealed that the protein product of this gene is a 
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transcription factor - a protein that plays a role in the regulation of gene 
expression. The researchers showed that mutations in this gene are 
responsible for the pathogenesis of APS1 (Nagamine et al., Na.t Genet., 
17141:393-8 {1997)). 
5 In a specific embodiment, the autoimmune polyglandular syndrome 

type I to be predicted or diagnosed according to the present method is 
associated with a mutation in AIRE gene. 

b. Inflammatory bowel disease 
Inflammatory bowel disease (IBD) is a group of chronic disorders 

10 that cause inflammation or ulceration in the small and large intestines. 
Most often, IBD is classified either as ulcerative colitis or Crohn disease. 
While ulcerative colitis affects the inner lining of the colon and rectum, 
Crohn disease extends into the deeper layers of the intestinal wall. It is a 
chronic condition and may recur at various times over a lifetime. About 

1 5 20% of cases of Crohn disease appear to run in families. It is a 'complex 
trait', which means that several genes at different locations in the 
genome may contribute to the disease. A susceptibility locus for the 
disease was recently mapped to chromosome 1 6. Candidate genes found 
in this region include several involved in the inflammatory response, 

20 including: CD19, involved in B-lymphocyte function; sialophorin, involved 
in leukocyte adhesion; the CD1 1 integrin cluster, involved in 
microbacteria cell adhesion; and the interleukin-4 receptor, which is 
interesting, as IL-4-mediated functions are altered in IBDs (Hugot et al., 
Nature, 379(6568) :821-3 (1996)). 

25 In a specific embodiment, the inflammatory bowel disease to be 

predicted or diagnosed according to the present method is associated 
with a mutation in CD19, sialophorin, CD1 1 integrin cluster or interleukin- 
4 receptor. 

0. DiGeorge syndrome 

30 DiGeorge syndrome is a rare congenital (Le., present at birth) 

disease whose symptoms vary greatly between Individuals, but 
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commonly include a history of recurrent infection, heart defects and 
characteristic facial features, DiGeorge syndrome is caused by a large 
deletion from chromosome 22, produced by an error in recombination at 
meiosis (the process that creates germ cells and ensures genetic variation 
5 in the offspring). This deletion means that several genes from this region 
are not present in DiGeorge syndrome patients. It appears that the 
variation in the symptoms of the disease is related to the amount of 
genetic material lost in the chromosomal deletion (Budarf et al., Nat 
Genet, 10(3) :269-78 (1995)). 

10 d. Familial Mediterranean fever 

Familial Mediterranean fever (FMF) is an inherited disorder usually 
characterized by recurrent episodes of fever and peritonitis (inflammation 
of the abdominal membrane). In 1 997, researchers identified the gene for 
FMF and found several different gene mutations that cause this inherited 

15 rheumatic disease. The gene, found on chromosome 16, codes for a 

protein that is found almost exclusively in granulocytes - white blood cells 
important in the Immune response. The protein is likely to normally assist 
in keeping inflammation under control by deactivating the immune 
response - without this 'brake', an inappropriate full-blown Inflammatory 

20 reaction occurs: an attack of FMF (Cell, 90(4) :797-807 (1997); and.A/af. 
Gene/.,17m: 25-31 (1997)). 

In a specific embodiment, the familial Mediterranean fever to be 
predicted or diagnosed according to the present method is associated 
with a mutation in FMF gene. 

25 e. Severe combined immunodeficiency 

Severe combined immunodeficiency (SCID) represents a group of 
rare, sometimes fatal, congenital disorders characterized by little or no 
immune response (Valeric et al., EMBOJ., 4(2) :437"43 (1985); and 
Noguchi et al., Ce//, 7301:147-57 (1993)). The defining feature of SCID, 

30 commonly known as "bubble boy" disease, is a defect in the specialized 
white blood cells (B- and T-lymphocytes) that defend us from infection by 
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viruses, bacteria and fungi. Without a functional immune system, SCID 
patients are susceptible to recurrent infections such as pneumonia, 
meningitis and chicken pox, and can die before the first year of life. All 
forms of SCID are inherited, with as many as half of SCID cases linked to 
5 the X chromosome, passed on by the mother. X-linked SCID results from 
a mutation in the interleukin 2 receptor gamma (IL2RG) gene which 
produces the common gamma chain subunit, a component of several IL 
receptors. Defective IL receptors prevent the proper development of T- 
lymphocytes that play a key role in identifying invading agents as well as 

10 activating and regulating other cells of the immune system. In another 
form of SCID, there is a lack of the enzyme adenosine deaminase (ADA), 
coded for by a gene on chromosome 20. This means that the substrates 
for this enzyme accumulate in cells. Immature lymphoid cells of the 
immune system are particularly sensitive to the toxic effects of these 

1 5 unused substrates, so fail to reach maturity. As a result, the immune 
system of the afflicted individual is severely compromised or completely 
lacking. 

In a specific embodiment, the severe combined immunodeficiency 
to be predicted or diagnosed according to the present method is 
20 associated with a mutation in interleukin 2 receptor gamma (IL2RG) or 
adenosine deaminase (ADA). 

3. Metabolism system diseases and disorders 
Any metabolism diseases or disorders that are associated with a 
mutation(s) in a nucleic acid can be predicated or diagnosed using the . 
25 present methods. For example, adrenoleukodystrophy (ALD), 

atherosclerosis, Gaucher disease, gyrate atrophy of the choroid, diabetes, 
obesity, paroxysmal nocturnal hemoglobinuria (PNH), phenylketonuria 
(PKU), Refsum disease and Tangier disease (TD) can be predicted or 
diagnosed using the present methods. 
30 a. Adrenoleukodystrophy 

Adrenoleukodystrophy (ALD) is a rare, inherited metabolic disorder. 
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In this disease the fatty covering (myelin sheath) on nerve fibers in the 
brain is lost, and the adrenal gland degenerates, leading to progressive 
neurological disability and death. People with ALD accumulate high levels 
of saturated, very long chain fatty acids in their brain and adrenal cortex 
5 because the fatty acids are not broken down by an enzyme in the normal 
manner. So, when the ALD gene was discovered in 1993, it was a 
surprise that the corresponding protein was in fact a member of a family 
of transporter proteins, not an enzyme (Mosser et al.. Nature, 
361(64141 :726-30 (1993)). 

10 In a specific embodiment, the adrenoleukodystrophy to be 

predicted or diagnosed according to the present method is associated 
with a mutation in ALD gene, 

b. Atherosclerosis 
Atherosclerosis is characterized by a narrowing of the arteries 

1 5 caused by cholesterol-rich plaques of immune-system cells. Key risk 
factors for atherosclerosis, which can be genetic and/or environmental, 
include: elevated levels of cholesterol and triglyceride in the blood, high 
blood pressure and cigarette smoke. A protein called apolipoprotein E, 
which can exist in several different forms, is coded for by a gene found 

20 on chromosome 1 9. It is important for removing excess cholesterol from 
the blood, and does so by carrying cholesterol to receptors on the surface 
of liver cells. Defects in apolipoprotein E sometimes result in its inability 
to bind to the receptors, which leads to an increase a person's blood 
cholesterol, and consequently their risk of atherosclerosis (Das et al., J. 

25 BioL Chem., 260(1 0) :6240-7 (1985); and Breslow, Science, 
272152621:685-8 (1996)). 

In a specific embodiment, the atherosclerosis to be predicted or 
diagnosed according to the present method is associated with a mutation 
in apolipoprotein E. . 

30 c. Gaucher disease 

Gaucher disease is an inherited illness caused by a gene mutation 

-78- 



wo 01/62968 



PCTAJSOl/00452 



(Barneveld et al., Hum. Genet, 64(3) :227-31 (1983); and Beutler, 
Science, 256(5058) :794-9 (1992)). Normally, this gene is responsible 
for an enzynne called glucocerebrosidase that the body needs to break 
down a particular kind of fat called glucocerebroside. In people with 
5 Gaucher disease, the body is not able to properly produce this enzyme 
and the fat cannot be broken down, jt then accumulates, mostly in the 
liver, spleen and bone marrow. Gaucher disease can result in pain, 
fatigue, jaundice, bone damage, anemia and even death. 

In a specific embodiment, the Gaucher disease to be predicted or 
10 diagnosed according to the present method is associated with a mutation 
in glucocerebrosidase. 

d. Gyrate atrophy of the choroid 

People suffering from gyrate atrophy of the choroid (the thin 
coating of the eye) and retina face a progressive loss of vision, with total 

15 blindness usually occurring between the ages of 40 and 60. The disease 
is an inborn error of metabolism. The gene whose mutation causes 
gyrate atrophy is found on chromosome 10, and encodes an enzyme 
called ornithine ketoacid aminotransferase (OAT) (Akaki et al., J, Biol. 
Chem., 267(18) :12950-4 (1992); and O'Donnell et al., Am. J. Hum. 

20 Genet, 43(6) :922-8 (1988)). Different inherited mutations in OAT cause 
differences in the severity of^symptoms of the disease. OAT converts 
the amino acid ornithine from the urea cycle ultimately into glutamate. In 
gyrate atrophy, where OAT function is affected, there is an increase in 
plasma levels of ornithine. 

25 In a specific embodiment, the gyrate atrophy of the choroid to be 

predicted or diagnosed according to the present method is associated 
with a mutation in ornithine ketoacid aminotransferase (OAT). 

e. Diabetes 

Diabetes is a chronic metabolic disorder that adversely affects the 
30 body's ability to manufacture and use insulin, a hormone necessary for 
the conversion of food into energy. The disease greatly increases the risk 
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of blindness, heart disease, kidney failure, neurological disease and other 
conditions for the approximately 16 nDillion Americans who are affected 
by It. Type I, or juvenile onset diabetes, is the more severe form of the 
illness. Type I diabetes is what is known as a 'complex trait', which 
5 means that mutations in several genes likely contribute to the disease 
(Nuffield et al., Nature, 371 (6493) :1 30-6 (1994)). For example, it is now 
known that the insulin-dependent diabetes mellitus (IDDMI) locus on 
chromosome 6 may harbor at least one susceptibility gene for Type I 
diabetes. In Type I diabetes, the body's immune system mounts an 

10 immunological assault on its own insulin and the pancreatic cells that 
manufacture it. About 10 loci in the human genome have now been 
found that seem to confer susceptibility to Type I diabetes. Among these 
are (1) a gene at the locus IDDM2 on chromosome 1 1 and (2) the gene 
for glucokinase (GCK), an enzyme that is key to glucose metabolism 

15 which helps modulate insulin secretion, on chromosome 7. 

In a specific embodiment, the diabetes of the choroid to be 
predicted or diagnosed according to the present method is associated 
with a mutation in insulin-dependent diabetes mellitus (IDDMI) locus, a 
gene at the locus IDDM2, or glucokinase (GCK). 

20 

f. Obesity 

Obesity is an excess of body fat that frequently results in a 
significant impairment of health. Evidence suggests that obesity has 

25 more tha,n one cause: gerietic, environmental, psychological and other 
factors may all play a part. The hormone leptin, produced by adipocytes 
(fat cells), was discovered about three years ago in mice (Zhang et al.. 
Nature, 372(6505) :425-32 (1994)). Subsequently the human Ob gene 
was mapped to chromosome 7. Leptin is thought to act as a lipostat: as 

30 the amount of fat stored in adipocytes rises, leptin is released into the 
blood and signals to the brain that the body has enough to eat. Most 
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overweight people have high levels of leptin in their bloodstream, 
indicating that other molecules also effect feelings of salty and contribute 
to the regulation of body weight. 

In a specific embodiment; the obesity to be predicted or diagnosed 
5 according to the present method is associated with a mutation in leptin or 
human Ob gene. 

g. Paroxysmal nocturnal hemoglobinuria 
The paroxysmal nocturnal hemoglobinuria (PNH) is characterized by 
a decreased number of red blood cells (anemia), and the presence of 

10 blood in the urine (hemoglobinuria) and plasma (hemoglobinemia), which 
is evident after sleeping. PNH is associated with a high risk of major 
thrombotic events, most commonly thrombosis of large intra-abdominal 
veins. Most patients who die of their disease die of thrombosis. PNH 
blood cells are deficient in an enzyme known as PIG-A, which is required - 

1 5 for the biosynthesis of cellular anchors (Bessler et al., EMBO J., 

1301:110-7 (1994); and Miyata et al., Science, 259(50991 :1318-20 
(1 993)). Proteins that are partly on the outside of cells are often 
attached to the cell membrane by a glycosylphosphatidylinositol (GPI) 
anchor, and PIG-A is required for the synthesis of a key anchor 

20 component. If PIG-A is defective, surface proteins that protect the cell 
from destructive components in the blood (complement) are not anchored 
and therefore absent, so the blood cells are broken down. The PIG-A 
gene is found on the X chromosome. Although not an inherited disease, 
PNH is a genetic disorder, known as an acquired genetic disorder. The 

25 affected blood ceil clone passes the altered PIG-A to all its descendants- 
red cells, leukocytes (including lymphocytes), and platelets. The 
proportion of abnormal red blood cells in the blood determines the 
severity of the disease. 

In a specific embodiment, the paroxysmal nocturnal hemoglobinuria 

30 to be predicted or diagnosed according to the present method is 
associated with a mutation in PIG-A. 
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h. Phenylketonuria 

Phenylketonuria (PKU) is an inherited error of metabolism caused 
by a deficiency in the enzyme phenylalanine hydroxylase (DiLella et al.. 
Nature, 327(61 20) :333-6 (1987); and Kwok et aL, Biochemistry, 
5 24(31:556-61 (1985)). Loss of this enzyme results in mental retardation, 
organ damage, unusual posture and can, in cases of maternal PKU, 
severely compromise pregnancy. Classical PKU is an autosomal recessive 
disorder, caused by mutations in both alleles of the gene for 
phenylalanine hydroxylase (PAH), found on chromosome 12. In the body, 

10 phenylalanine hydroxylase converts the amino acid phenylalanine to 
tyrosine, another amino acid. Mutations in both copies of the gene for 
PAH means that the enzyme is inactive or Is less efficient, and the 
concentration of phenylalanine in the body can build up to toxic levels. In 
some cases, mutations in PAH will result in a phenotypically mild form of 

15 PKU called hyperphenylalanemla. Both diseases are the result of a variety 
of mutations in the PAH locus; in those cases where a patient is 
heterozygous for two mutations of PAH (ie each copy of the gene has a 
different mutation), the milder mutation will predominate. 

In a specific embodiment, the phenylketonuria to be predicted or 

20 diagnosed according to the present method is associated with a mutation 
in phenylalanine hydroxylase. 

i. Refsum disease 

Refsum disease is a rare disorder of lip.id metabolism that is 
inherited as a recessive trait. Symptoms may include a degenerative 

25 nerve disease (peripheral neuropathy), failure of muscle coordination 

(ataxia), retinitis pigmentosa (a progressive vision disorder), and bone and 
skin changes. Refsum disease is characterized by an accumulation of 
phytanic acid in the plasma and tissues. Is a derivative of phytol, a 
component of chlorophyll. In 1997 the gene for Refsum disease was 

30 identified and mapped to chromosome 10 (Jansen et al., Nat Genet, 

17(2) : 190-3 (1997); and Mihalik et al., Nat Genet, 17(2) :185-9 (1997)). 
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The protein product of the gene, PAHX, is an enzyme that is required for 
the metabolism of phytanic acid. Refsum disease patients have impaired 
PAHX - phytanic acid hydrolase. 

In a specific embodiment, the Refsum disease to be predicted or 
5 diagnosed according to the present method is associated with a mutation 
in PAHX. 

j. Tangier disease 
Tangier disease (TD) is a genetic disorder of cholesterol transport 
named for the secluded island of Tangier, located off. the coast of 

10 Virginia. TD was first identified In a five-year-old inhabitant of the island 
who had characteristic orange tonsils, very low levels of high density 
lipoprotein (HDL) or 'good cholesterol', and an enlarged liver and spleen. 
TD is caused by mutations in the ABC1 (ATP-binding cassette) gene on 
chromosome 9q31 (Rust et al., A/af. Genet. 22(4) :352-5 (1999); 

15 Bodzioch et al., Nat. Genet., 22(4) :347-51 (1999); Brooks-Wilson et al., 
Nat Genet, 22(4) :336-45 (1999); and Rust et al., Nat Genet] 
20(11 :96-8 (1998)). ABC1 codes for a protein that helps rid cells of 
excess cholesterol. This cholesterol is then picked up by HDL particles in 
the blood and carried to the liver, which processes the cholesterol to be 

20 reused in cells throughout the body. Individuals with TD are unable to 
eliminate cholesterol from cells, leading to its buildup in the tonsils and 
other organs. 

In a specific embodiment, the Tangier disease to be predicted or 
diagnosed according to the present method is associated with a mutation 
25 in ABC1 (ATP-binding cassette) gene on chromosome 9q31 • 
4. Muscle and bone diseases and disorders 
Any muscle and bone diseases or disorders that are associated 
with a mutation(s) in a nucleic acid can be predicted or diagnosed using 
the present methods. For example, Duchenne muscular dystrophy 
30 (DMD), ELLIS-VAN CREVELD syndrome (chondroectodermal dysplasia), 
Marfan syndrome and myotonic dystrophy can be predicted or diagnosed 
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using the present methods. 

a. Duchenne muscular dystrophy 

Duchenne muscular dystrophy (DMD) is one of a group of muscular 
dystrophies characterized by the enlargement of muscles. The gene for 
5 DMD, found on the X chromosome, encodes a large protein - dystrophin 
(Koenig et a!.. Cell, 53(2) :219-26 (1988)). Dystrophin is required inside 
muscle cells for structural support: it is thought to strengthen muscle 
cells by anchoring elements of the internal cytoskeleton to the surface 
membrane. Without it, the cell membrane becomes permeable, so that 
10 extracellular components enter the cell, increasing the internal pressure 
until the muscle cell 'explodes' and dies. The subsequent immune 
response can add to the damage. 

In a specific embodiment, the Duchenne muscular dystrophy to be 
predicted or diagnosed according to the present method is associated 
1 5 with a mutation in dystrophin, 

b. Ellis-Van Ore veld syndrome 

Ellis-Van Creveld syndrome, also known as 'chondroectodermal 
dysplasia', is a rare genetic disorder characterized by short-limb dwarfism, 
Polydactyly (additional fingers or toes), malformation of the bones of the 

20 wrist, dystrophy of the fingernails, partial hare-lip, cardiac malformation 
and often prenatal eruption of the teeth. The gene causing Ellis-van 
Creveld syndrome, EVC, has been mapped to the short arm of 
chromosome 4 (Polymeropoulos et al.. Genomics, 35(1) :1-5 (1996)). A 
pattern of inheritance can be observed that has indicated the disease is 

25 autosomal-recessive (i.e., a mutated gene form both parents is required 
before the effects of the disease to become apparent). 

In a specific embodiment, the Ellis-Van Creveld syndrome to be 
predicted or diagnosed according to the present method is associated 
with a mutation in EVC gene. 

30 c. Marfan syndrome 

Marfan syndrome is a connective tissue disorder, so affects many 
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structures, including the skeleton, lungs, eyes, heart and blood vessels. 
The disease is characterized by unusually long limbs. Marfan syndrome is 
an autosomal dominant disorder that has been linked to the FBN1 gene 
on chromosome 15 (Dietz et al., Nature, 352(6333) :337"9 (1991); and 
5 Kainulainen et al., N. Engl, J. Med.\ 323(1 4) :935-9 (1990)). FBN1 
encodes a protein called fibrillin, which is essential for the formation of 
elastic fibers found in connective tissue. Without the structural support 
provided by fibrillin, many tissues are weakened, which can have severe 
consequences, for example, ruptures in the walls of major arteries, 
10 In a specific embodiment, the Marfan syndrome to be predicted or 

diagnosed according to the present method is associated with a mutation 
inFBNI. 

d. Myotonic dystrophy 
Myotonic dystrophy is an inherited disorder in which the muscles 

1 5 contract but have decreasing power to relax. With this condition, the 
muscles also become weak and waste away. Myotonic dystrophy can 
cause mental deficiency, hair loss and cataracts. Onset of this rare 
disorder commonly occurs during young adulthood. It can occur at any 
age and is extremely variable in degree of severity. The myotonic 

20 dystrophy gene, found on chromosome 1 9, codes for a protein kinase 
that is found in skeletal muscle, where it likely plays a regulatory role 
(Aslanidis et aL, Nature, 355(6360) :548-51 (1992)). An unusual feature 
of this illness is that its symptoms usually become more severe with each 
successive generation. This is because mistakes in the faithful copying of 

25 the gene from one generation to the next result In the amplification of a 
'AGC triplet repeat', similar to that found in Huntington disease. 
Unaffected individuals have between 5 and 27 copies of AGC, myotonic 
dystrophy patients who are minimally affected have at least 50 repeats, 
while more severely affected patients have an expansion of up to several 

30 kilobase pairs. 

In a specific embodiment, the myotonic dystrophy to be predicted 
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or diagnosed according to the present method is associated with a 
mutation in myotonic dystrophy gene. 

5. Nervous system diseases and disorders 
Any nervous system diseases and disorders that are associated 
5 with a mutation(s) in a nucleic acid can be predicted or diagnosed using 
the present methods. For example, Alzheimer disease (AD), amyotrophic 
lateral sclerosis (ALS), Angelman syndrome (AS), Chareot-Marle-tooth 
disease (CMT), epilepsy, tremor, fragile X syndrome, Friedreich's ataxia 
(FRDA), Huntington disease (HD), Niemann-Pick, Parkinson disease, . 

10 Prader-Willi syndrome (PWS), spinocerebellar atrophy and Williams 
syndrome can be predicted or diagnosed using the present methods, 
a. Alzheimer's Disease 
Alzheimer' Disease (AD) is the fourth leading cause of death in 
adults. The incidence of the disease rises steeply with age. Some of the 

15 most frequently observed symptoms of the disease include a progressive 
inability to remember facts and events and, later, to recognize friends and 
family. Certain types of AD run in families: currently, mutations in four 
genes, situated on chromosomes 1, 14, 19 and 21, are believed to play a 
role in the disease. The best-characterized of these are PS1 (or AD3) on 

20 chromosome 14 and PS2 (or AD4) on chromosome 1 (Levy-Lahad et al., 
Science, 269(52261 :973-7 (1995); and Sherrington et al.. Nature, 
375(6534) :754-60 (1995)). The formation of lesions made of 
fragmented brain cells surrounded by amyloid-family proteins are 
characteristic of the disease. These lesions and their associated proteins 

25 are closely related to similar structures found in Down's Syndrome. 
Tangles of filaments largely made up of a protein associated with the 
cytoskeleton have also been observed in samples taken from Alzheimer 
brain tissue. 

In a specific embodiment, the Alzheimer disease to be predicted or 
30 diagnosed according to the present method is associated with a mutation 
in the AD1 , AD2, AD3 or AD4 gene. 
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b. Amyotrophic lateral sclerosis 

Amyotrophic lateral sclerosis (ALS) is a neurological disorder 
characterized by progressive degeneration of motor neuron cells in the 
spinal cord and brain, which ultimately results in paralysis and death. The 
5 S0D1 gene was identified as being associated with many cases of 
familial ALS (Rosen et al., Nature, 362(64151 :59-62 (1993)). The 
enzyme coded for by S0D1 carries out a very important function in cells: 
it removes dangerous superoxide radicals by converting them into non- 
harmful substances. Defects in the action of this enzyme mean that the 
10 superoxide radicals attack cells from the inside, causing their death. 
Several different mutations in this enzyme all result in ALS, making the 
exact molecular cause of the disease difficult to ascertain. 

In a specific embodiment, the amyotrophic lateral sclerosis to be 
predicted or diagnosed aqcording to the present method is associated 
15 with a mutation in S0D1 . 

c. Angelman syndrome 

Angelman syndrome (AS) is an uncommon neurogenetic disorder 
characterized by mental retardation, abnormal gait, speech impairment, 
seizures, and an inappropriate happy demeanor which includes frequent 

20 laughing, smiling, and excitability. The genetic basis of AS is very 
complex, but the majority of cases are due to a deletion of segment 
1 5q1 1 -q1 3 on the maternally derived chromosome 1 5. When this same 
region is missing from the paternally derived chromosome, an entirely 
different disorder, Prader-Willi syndrome, results. This phenomenon - 

25 when the expression of genetic material depends on whether it has been 
. inherited from the mother or the father - is termed genomic imprinting. 
The ubiquitin ligase gene (U6E3A) is found in the AS chromosomal region 
(Jiang et al.. Am. J. Hum. Genet, 65(1) :1-6 (1999); Albrecht et al., Nat 
Genet, 17(1) :75-8 (1997); and Kishino et al., Nat Genet, 15(1) :70-3 

30 (1 997)). It codes for an enzyme that is a key part of a cellular protein 
degradation system. AS is thought to occur when mutations in UBE3A 
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disrupt protein break down during brain development. 

In a specific embodiment, the Angelman syndrome to be predicted 
or diagnosed according to the present method is associated with a 
mutation in ubiquitin ligase gene (UBE3A). 
5 d. Charcot-Marle-tooth disease 

Charcot-Marle-tooth disease (CMT) disease is characterized by a 
slowly progressive degeneration of the muscles in the foot, lower leg, 
hand and forearm, and a mild loss of sensation in the limbs, fingers and 
toes. CMT is a genetically heterogeneous disorder, in which mutations in 

10 different genes can produce the same clinical symptoms (Lagemann, 
ROFO Fortschr Geb Rontgenstr Nuklearmed, 124(1) ;69-75 (1976); and 
Hayasaka et al,, Genomics, 17(3) :755-8 (1993)). In CMT, there are not 
only different genes but different patterns of inheritance. One of the 
most common forms of CMT is Type 1 A. The gene for Type 1A CMT 

1 5 maps to chromosome 17 and Is thought to code for a protein (PMP22) 
involved in coating peripheral nerves with myelin, a fatty sheath that is 
important for their conductance. Other types of CMT include Type 1 B, 
autosomal-recessive and X-linked. The same proteins involved in the 
Type 1A and Type IB CMT are also involved in a disease called Dejerine- 

20 Sottas syndrome (DSS), in which similar clinical symptoms are presented, 
but they are more severe. 

In a specific embodiment, the Charcot-Marle-tooth disease to be 
predicted or diagnosed according to the present method is associated 
with a mutation in type 1 A or type 1 B CMT gene. 

25 e. Epilepsy 

Epilepsy is characterized by recurring seizures resulting from 
abnormal cell firing in the brain. There are many forms of epilepsy - most 
are rare. To date, twelve forms of epilepsy have been demonstrated to 
possess some genetic basis. For example, LaFora Disease (progressive 

30 myoclonic, type 2) is a particularly aggressive epilepsy inherited in an 
autosomal recessive fashion (Minassian et al., Nat Genet, 20(2) :171"4 
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(1 998)). LaFora Disease is thought to result from a mutation in the 
EPM2A gene, which is located on chromosome 6. This gene is thought 
to produce laforin, a protein similar to a group of protein-tyrosine 
phosphatases that help maintain a balance of sugars in the blood stream, 
5 Too much laforin may destroy brain cells, which may then lead to the 
development of LaFora Disease. 

In a specific embodiment, the epilepsy to be predicted or diagnosed 
according to the present method is associated with a mutation in EPM2A. 
f. Tremor 

10 Tremor, or uncontrollable shaking, is a common symptom of 

neurological disorders such as Parkinson's disease, head trauma and 
stroke. Many people with tremor have what is called idiopathic or 
essential tremor. In these cases, the tremor itself is the only symptom of 
the disorder. While essential tremor may involve other parts of the body, ' 

1 5 the hands and head are most often affected. In more than half of cases, 
essential tremor is inherited as an autosomal dominant trait, which means 
that children of an affected individual will have a 50 percent chance of 
also developing the disorder. In 1997, the ETM1 gene (also called FET1) 
was mapped to chromosome 3 in a study of Icelandic families, while 

20 another gene, called ETM2, was mapped to chromosome 2 in a large 
American family of Czech descent (Gulcher et al., Nat Genet, 17(1) :84- 
7 (1997)). That two genes for essential tremor have been found on two 
different chromosomes demonstrates that mutations in a variety of genes 
may lead to essential tremor. 

25 In a specific embodiment, the tremor to be predicted or diagnosed, 

according to the present method is associated with a mutation in ETM1 
or ETM2. 

g. Fragile X syndrome 

Fragile X syndrome is the most common inherited form of mental 
30 retardation currently known. Fragile X syndrome is a defect in the X 
chromosome and its effects are seen more frequently, and with greater 
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severity, in males than females. In normal individuals, the FMR1 gene is 
transmitted stably from parent to child. In Fragile X individuals, there is a 
mutation in one end of the gene (the 5' untranslated region), that 
involves amplification of a CGG repeat {Siomi et al., Cell, 74(2) :291-8 
5 (1993)). Patients with fragile X syndrome have 200 or more copies of 
the CGG motif. The huge expansion of this repeat means that the FMR1 
gene is not expressed, so no FMR1 protein is made. Although the exact 
function of FMR1 protein in the cell is unclear, it is known that it binds 
RNA. 

10 In a specific embodiment, the fragile X syndrome to be predicted or 

diagnosed according to the present nriethod is associated with a mutation 
in FMR1 gene. 

h. Friedreich's ataxia 

Friedreich's ataxia (FRDA) is a rare Inherited disease characterized 
1 5 by the progressive loss of voluntary muscular coordination (ataxia) and 
heart enlargement. FRDA is an autosomal recessive disease caused by a 
mutation of a gene called frataxin, which is located on chromosome 9 
(Campuzano et al.. Science, 271 (5254) :1423-7 (1996); and Babcock et 
al.. Science, 276(531 9) ;1709-12 (1997)). This mutation means that 
20 there are many extra copies of a DNA segment, the trinucleotide GAA. A 
normal individual has 8 to 30 copies of this trinucleotide, while FRDA 
patients have as many as 1000. The larger the number of GAA copies, 
the earlier the onset of the disease and the quicker the decline of the 
patient. 

25 . In a specific embodiment, the Friedreich's ataxia to be predicted or 
diagnosed according to the present method is associated with a mutation 
in frataxin. 

i. Huntington disease 

Huntington disease (HD) is an inherited, degenerative neurological 
30 disease that leads to dementia. The HD gene, whose mutation results in 
Huntington disease, was mapped to chromosome 4 in 1 983 and cloned in 
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1993 (Cell, 72(6) :971-83 (1993)). The mutation is a characteristic 
expansion of a nucleotide triplet repeat in the DNA that codes for the 
protein hunting tin. The number of repeated triplets - CAG (cytosine, 
adenine, guanine) - increases with the age of the patient. Since people 
5 who have those repeats always suffer from Huntington disease, it 

suggests that the mutation causes a gain-of-function, in which the mRNA 
or protein takes on a new property or is expressed inappropriately. 

In a specific embodiment, the Huntington disease to be predicted 
or diagnosed according to the present method is associated with a 

10 mutation in the HD gene. 

j. Niemann-Pick's disease 
In 1914, German Pediatrician Albert Niemann described a young 
child with brain and nervous system impairment. Later, in the 1 920's, 
Luddwick Pick studied tissues after the death of such children and 

1 5 provided evidence of a new disorder, distinct from those storage 

disorders previously described. Today, there are three separate diseases 
that carry the name Niemann-Pick: Type A is the acute infantile form. 
Type B is a less common, chronic, non-neurological form, while Type C is 
a biochemically and genetically distinct form of the disease. Recently, 

20 the major locus responsible for Niemann-Pick type C (NP-C) was cloned 
from chromosome 18, and found to be similar to proteins that play a role 
in cholesterol homeostasis (Carstea, Science, 277(5323) :228-31 (1997); 
and Loftus, Science, 277(5323) :232-5 (1997)). Usually, cellular 
cholesterol is imported into lysosomes - 'bags of enzymes' in the cell - for 

25 processing, after which it is released. Cells taken from NP-C patients 

have been shown to be defective in releasing cholesterol from lysosomes. 
This leads to an excessive build-up of cholesterol inside lysosomes, 
causing processing errors. NPC1 was found to have known sterol- 
sensing regions similar to those in other proteins, which suggests it plays 

30 a role in regulating cholesterol traffic. 

In a specific embodiment, the Niemann-Pick to be predicted or 
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diagnosed according to tlie present method is associated witli a mutation 
inNPCI. 

k. Parkinson disease 

Parkinson disease is a neurodegenerative disease that manifests as 
5 a tremor, muscular stiffness and difficulty with balance and walking. A 
classic pathological feature of the disease is the presence of an inclusion 
body, called the Lewy body, in many regions of the brain. A candidate 
gene for some cases of Parkinson disease was mapped to chromosome 4 
(Polymeropoulos et al.. Science, 276(5321 ) :2045-7 (1997)). IVIutations 
10 in this gene have now been linked to several Parkinson disease families. 
The product of this gene, a protein called alpha-synuclein, is a familiar 
culprit: a fragment of it is a known constituent of Alzheimer disease 
plaques.. 

In a specific embodiment, the Parkinson disease to be predicted or * 
1 5 diagnosed according to the present method is associated with a mutation 
in GT-synuclein. 

i. Spinocerebellar atrophy 
Persons with spinocerebellar atrophy, of which there are several 
types, experience a degeneration of the spinal cord and the cerebellum, 
20 the small fissured mass at the base of the brain, behind the brain stem. 
The cerebellum is concerned with coordination of movements, so atrophy 
or "wasting away" of this critical control center results in a loss of 
muscle coordination. Atrophy in the spine can bring spasticity. The 
basic defect in all types of spinocerebellar atrophy is a an expansion of a 
25 CAG triplet repeat. In this way, it is similar to fragile-X syndrome, 

Huntington disease and myotonic dystrophy, all of which exhibit a triplet 
repeat expansion of a gene. In the case of spinocerebellar atrophy I, the 
gene is SCA1, found on chromosome 6 (Banfi et al., Nat. Genet, 
7 (4) :5 13-20 (1994)). The protein product of the gene - called ataxin-1 - 
30 varies in size, depending on the size of the CAG triplet repeat. 

In a specific embodiment, the Prader-Willi syndrome to be predicted 

92- 



wo 01/62968 PCT/USOl/00452 

or diagnosed according to the present method is associated with a 
mutation in the small ribonucleoprotein N (SNRPN). 
m. Williams syndrome 
Williams syndrome Is a rare congenital disorder characterized by 
5 physical and development problems. Common features include 

characteristic "elfin-like" facial features, heart and blood vessel problems, 
irritability during infancy, dental and kidney abnormalities, hyperacusis 
(sensitive hearing) and musculoskeletal problems. In Williams syndrome 
individuals, the gene for.elastin and an enzyme called LIM kinase are 

10 deleted (Frangiskakis et al.. Cell, 86(1) :59-69 (1996); and Lenhoff et al., 
ScL Am., 277(6) :68-73 (1997)), Both genes map to the same small area 
on chromosome 7. In normal cells, elastin is a key component of 
connective tissue, conferring its elastic properties. Mutation or deletion 
of elastin lead to the vascular disease observed in Williams syndrome. On ' 

15 the other hand, LIM kinase is strongly expressed in the brain, and deletion 
of LIM kinase is thought to account for the impaired visuospatial 
constructive cognition in Williams syndrome. Williams syndrome is a 
contiguous disease, meaning that the deletion of this section of 
chromosome 7 may involve several more genes. Further study will be 

20 required to round up all the genes deleted in this disease. 

In a specific embodiment, the Williams syndrome to be predicted or 
diagnosed according to the present method is associated with a mutation 
in elastin and LIM kinase. 

6. Signal disease or disorder 

25 Any signal diseases or disorders that are associated with a 

mutation(s) in a nucleic acid can be predicted or diagnosed using the 
present methods. For example, ataxia telangiectasia (A-T), male pattern 
baldness, acne, hirsutism, Cockayne syndrome, glaucoma, mammals with 
abnormal secondary sexual characteristics, tuberous sclerosis, 

30 Waardenburg syndrome (WS) and Werner syndrome (WRN) can be 
predicted or diagnosed using the present methods. 
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a. Ataxia telangiectasia 

The first signs of ataxia telangiectasia (A-T) usually appear in the 
second year of life as a lack of balance and slurred speech. It is a 
progressive, degenerative disease characterized by cerebellar 
5 degeneration, immunodeficiency, radiosensitivity (sensitivity to radiant 
energy, such as x-ray) and a predisposition to cancer. The gene 
responsible for A-T was mapped to chromosome 1 1 . The subsequent 
identification of the gene proved difficult: it was seven more years until 
the human ATM gene was cloned (Savitsky, Science, 268(521 8): 1 749- 

10 53 (1995); and Barlow Cell, 86(1) :159-71 (1996)). The diverse 

symptoms seen in A-T reflect the main role of ATM, which is to induce 
several cellular responses to DNA damage. When the ATM gene is 
mutated, these signaling networks are impaired and so the cell does not 
respond correctly to minimize the damage. 

15 In a specific embodiment, the ataxia telangiectasia to be predicted 

or diagnosed according to the present method is associated with a 
mutation in ATM. 

b. Male pattern baldness, acne or hirsutism 

20 Five-a reductase is an enzyme that was first discovered in the male 

prostate. Here, it catalyzes the conversion of testosterone to 
dihydrotestosterone, which in turn binds to the androgen receptor and 
initiates development of the external genitalia and prostate. The gene for 
5-alpha reductase has been mapped to chromosome 5 (Andersson and 

25 Russell, Proc. Natl. Acad. Set., 871101:3640-4 (1990); and Jenkins 
Genomics, 11(4) ;1102-12 (1991)). More recently, 5-alpha reductase 
was found in human scalp and elsewhere in the skin, where it carries out 
the same reaction as in the prostate. It is thought that disturbances in 5- 
alpha reductase activity in skin cells might contribute to male pattern 

30 baldness^ acne or hirsutism. 

In a specific embodiment, the male pattern baldness, acne or 
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hirsutism to be predicted or diagnosed according to tiie present method is 
associated with a mutation in 5-a reductase. 

c. Cockayne syndrome 

Cockayne syndrome is a rare inherited disorder in which people are 
5 sensitive to sunlight, have short stature and have the appearance of 
premature aging. In the classical form of Cockayne syndrome (Type I), 
the symptomis are progressive and typically become apparent after the 
age of one year. An early onset or congenital form of Cockayne 
syndrome (Type II) is apparent at birth* Interestingly, unlike other DNA 

10 repair diseases, Cockayne syndrome Is not linked to cancer. After 
exposure to UV radiation (found in sunlight), people with Cockayne 
syndrome can no longer perform a certain type of DNA repair, known as 
'transcription-coupled repair'. This type of DNA repair occurs 'on the fly', 
right as the DNA that codes for proteins is being replicated. Two genes 

1 5 defective in Cockayne syndrome, CSA and CSB, have been identified so 
far. The CSA gene is found on chromosome 5. Both genes code for 
proteins that interacts with components of the transcriptional machinery 
and with DNA repair proteins (van Gool, EMBO J., 16(14) :4155-62 
(1997)). ' • 

20 In a specific embodiment, the Cockayne syndrome to be predicted 

or diagnosed according to the present method is associated with a 
mutation in CSA or CSB. 

d. Glaucoma 

Glaucoma is a term used for a group of diseases that can lead to 
25 damage to the eye's optic nerve and result in blindness. The most 

common form of the disease is open-angle glaucoma, which affects about 
three million Americans, half of whom don't know they have it. 
Glaucoma has no symptoms at first but over the years can steal its 
victims' sight, with side vision being effected first. It is estimated that 
30 nearly 100,000 individuals in the US suffer from glaucoma due to a 

mutation in the GLC1A gene, found on chromosome 1 (Stone, Science, 
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275(5300) :668-70 (1997)). There has been some speculation as to the 
role of the gene product in the eye. As it is found in the structures of the 
eye involved in pressure regulation, it may cause increased pressure in 
the eye by obstructing the aqueous outflow. 
5 In a specific embodiment the glaucoma to be predicted or 

diagnosed according to the present method is associated with a mutation 
inGLClA. 

e. Abnormal secondary sexual characteristics 

Usually, a woman has two X chromosomes (XX) and a man one X 
10 and one Y (XY). Male and female characteristics sometimes can be found 
in one individual, and it is possible to have XY women and XX men. 
Analysis of such individuals has revealed some of the molecules involved 
in sex determination, including one called SRY, which is important for 
testis formation. SRY (which stands for sex-determining region Y gene) 
15 is found on the Y chromosome IBerta, Nature, 348(6300) :448"50 

(1990); and Goodfellow and Lovell-Badge, Annu. Rev. Genet., 27:71-92 
(1993)). In the cell, it binds to DNA and in doing so distorts it 
dramatically out of shape. This alters the properties of the DNA and 
likely alters the expression of a number of genes, leading to testis 
20 formation. Therefore XX men who lack a Y chromosome also lack SRY 
and frequently do not develop secondary sexua) characteristics in the 
usual way. 

In a specific embodiment, the abnormal secondary sexual 
characteristics to be predicted or diagnosed according to the present 
25' method is associated with a mutation in sex-determining region Y gene 
(SRY). 

f . Tuberous sclerosis 

Tuberous sclerosis is an hereditary disorder characterized by 
benign, tumor-like nodules of the brain and/or retinas, skin lesions, 
30 seizures and/or mental retardation. Patients may experience a few or all 
of the symptoms with varying degrees of severity. Two loci for tuberous 
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sclerosis have been found: TSC1 on chromosome 9, and TSC2 on 
chromosome 16 {Cell, 75(7) :1305"15 (1993)). It took four years to pin 
down a specific gene from the TSC1 region of chromosome 9: in 1997, a 
promising candidate was found. Called hamartin by the discoverers, it is 
5 similar to a yeast protein of unknown function, and appears to act as a 
tumor suppressor: without TSC1, growth of cells proceeds in an 
unregulated fashion, resulting in tumor formation (van Slegtenhorst, 
Science, 277(5327) :805'8 (1997)). TSC2 codes for a protein called 
tuberin, which, through database searches, was found to have a region of 

1 0 homology to a protein found in pathways that regulate the cell (GAP3, a 
GTPase-activation. protein). 

in a specific embodiment, the tuberous sclerosis to be predicted or 
diagnosed according to. the present method is associated with a mutation 
in TSC1 or TSC2. 

15 g- Waardenburg syndrome 

The main characteristics of Waardenburg syndrome (WS) include: a 
wide bridge of the nose; pigmentary disturbances such as two different 
colored eyes, white forelock and eyelashes and premature graying of the 
hair; and some degree of cochlear deafness. The several types of WS are 

20 inherited in dominant fashion, so researchers typically see families with 
several generations who have inherited one or more of the features. 
Type I of the disorder is characterized by displacement of the fold of the 
eyelid, while Type II does not include this feature, but instead has a 
higher frequency of deafness. The discovery of the human gene that 

25 causes Type I WS came about after scientists speculated that the gene 
that causes 'splotch mice' (mice with a splotchy coat coloring) might be 
the same gene that causes WS in humans. They located the human gene 
to chromosome 2 and found it was the same as mouse PaxS (Tassabehji 
et al., Nature, 355(6361): 635-6 (1992)). 

30 In a specific embodiment, the Waardenburg syndrome to be 

predicted or diagnosed according to the present method is associated 
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with a mutation in human homolog of mouse Pax3. 
h. Werner syndrome 
Werner syndrome is a premature aging disease that begins in 
adolescence or early adulthood and results in the appearance of old age 
5 by 30-40 years of age. Its physical characteristics may include short 
stature (common from childhood on) and other features usually 
developing during adulthood: wrinkled skin, baldness, cataracts, muscular 
atrophy and a tendency to diabetes mellitus, among others. The disorder 
is inherited and transmitted as an autosomal recessive trait. Cells from 

10 WS patients have a shorter lifespan in culture than do normal cells. The 
gene for Werner disease (WRN) was mapped to chromosome 8 and 
cloned: by comparing its sequence to existing sequences in GenBank, it is 
a predicted helicase belonging to the RecQ family (Gray et al., Nat 
Genet, 17(1) :100-3 (1997); and Sinclair et al.. Science, 

15 277153301:1313-6 (1997)). 

In a specific embodiment, the Werner syndrome to be predicted or 
diagnosed according to the present method is associated with a mutation 
An WRN gene. 

7. Transporter diseases and disorders 

20 Any transporter diseases and disorders that are associated with a 

mutation(s) in a nucleic acid can be predicted or diagnosed using the 
present methods.. For example, cystic fibrosis (CF), diastrophic dysplasia 
(DTD), long-QT syndrome (LQTS), Menkes' syndrome, pendred 
syndrome, adult polycystic kidney disease (APKD), Wilson's disease and 

25 Zellweger syndrome can be predicted or diagnosed using the present 
methods. 

a. Cystic fibrosis 
Cystic fibrosis (CF) is the most common fatal genetic disease in the 
US today. It causes the body to produce a thick, sticky mucus that clogs 
30 the lungs, leading to infection, and blocks the pancreas, stopping 

digestive enzymes from reaching the intestines where they are required to 
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digest food. CF is caused by a defective gene, which codes for a sodium 
and chloride (salt) transporter found on the surface of the epithelial cells 
that line the lungs and other organs (Riordan et al., Science, 
245149221:1066-73 (1989)). Several hundred mutations have been 
5 found in this gene, all of which result in defective transport of sodium 
and chloride by epithelial cells. The severity of the disease symptoms of 
CF is directly related to the characteristic effects of the particular 
mutation(s) that have been inherited by the sufferer. 

In a specific embodiment, the cystic fibrosis to be predicted or 
10 diagnosed according to the present method is associated with a mutation 
in the CF gene. 

b. Diastrophic dysplasia 

Diastrophic dysplasia (DTD) is a rare growth disorder in which 
patients are usually short, have club feet and have malformed hands and 

1 5 joints. Although found in all populations, it is particularly prevalent in 
Finland. The gene whose mutation results in DTD maps to chromosome 
5 and encodes a novel sulfate transporter (Hastbacka et al.. Genomics, 
11(4) :968-73 (1991); and Hastbacka et al., CeH, 78(6) :1073-87 (1994)). 
This ties in with the observation of unusual concentrations of sulfate in 

20 various tissues of DTD patients. Sulfate is important for skeletal joints 
because cartilage - the shpck-absorber of joints - requires sulfur during its 
manufacture. Adding sulfur increases the negative charge within 
cartilage, which contributes to its shock-absorbing properties. 

In a specific embodiment, the diastrophic dysplasia to be predicted 

25 or diagnosed according to the present method is associated with a 
mutation in the DTD gene. 

c. Long-QT syndrome 

Long-QT syndrome (LQTS) results from structural abnormalities in 
the potassium channels of the heart, which predispose affected persons 
30 to an accelerated heart rhythm (arrhythmia). This can lead to sudden loss 
of consciousness and may cause sudden cardiac death in teenagers and 
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young adults who are faced with stressors ranging from exercise to loud 
sounds, LQTS is usually inherited as an autosomal dominant trait (Wang 
et al., Nat. Genet, 12(1) : 17-23 (1996); and Barhanin et al.. Nature, 
384(66Q4) :78-80 (1996)). In the case of LQT1, which has been mapped 
5 . to chromosome 1 1 , mutations lead to serious structural defects in the 
person's cardiac potassium channels that do not allow proper 
transmission of the electrical impulses throughout the heart. There also 
appear to be other genes, tentatively located on chromosomes 3, 6 and 
1 1 whose mutated products may contribute to, or cause, LQT syndrome. 
10 In a specific embodiment, the lohg-QT syndrome to be predicted or 

diagnosed according to the present method is associated with a mutation 
in LQT1 . 

d. Menkes' syndrome 
Menkes' syndrome is an inborn error of metabolism that markedly 

15 decreases the cells' ability to absorb copper. The disorder causes severe 
cerebral degeneration and arterial changes, resulting in death in infancy. 
The disease can often be diagnosed by looking at a victim's hair, which 
appears to be whitish and kinked when viewed under a microscope. 
Menkes' disease is transmitted as an X-linked recessive trait. Sufferers 

20 can not transport copper, which is needed by enzymes involved in making 
bone, nerve and other structures (Chelly et al., Nat. Genet., 3(1) :14"9 
(1993)). A number of other diseases, including type IX Ehlers-Danlos 
syndrome, may be the result of allelic mutations {i.e., mutations in the 
same gene, but having slightly different symptoms) and it is hoped that 

25 research into these diseases may prove useful in fighting Menkes' 
disease. 

In a specific embodiment, the Menkes' syndrome to be predicted or 
diagnosed according to the present method is associated with a mutation 
in the copper transporter. 
30 e. Pendred syndrome 

Pendred syndrome is an inherited disorder that accounts for as 
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much as 10% of hereditary deafness. Patients usually also suffer fronn 
thyroid goiter. In December of 1997, scientists at NIH's National Human 
Genome Research Institute used the physical map of human chromosome 
7 to help identify an altered gene thought to cause pendred syndrome 
5 (Everett et al., Nat. Genet., 17(41 :411 -22 (1997)). The normal gene 
makes a protein, called pendrin, that is found at significant levels only in 
the thyroid and is closely related to a number of sulfate transporters. 
When the gene for this protein is mutated, the person carrying it will 
exhibit the symptoms of Pendred syndrome. 
10 In a specific embodiment, the pendred syndrome to be predicted or 

diagnosed according to the present method is associated with a mutation 
in pendrin. 

f . Adult polycystic Icidney disease 
Adult polycystic kidney disease (APKD) is characterized by large 

1 5 cysts in one or both kidrieys and a gradual loss of normal kidney tissue. 
The role of the kidneys in the body is to filter the blood, excreting the 
end-products of metabolism in the form of urine and regulating the 
concentrations of hydrogen, sodium, potassium, phosphate and other 
ions in the extracellular fluid. Patients with APKD can die from renal 

20 failure, or from the consequences of hypertension (high arterial blood 
pressure). In 1994 the European Polycystic Kidney Disease Consortium 
isolated a gene from chromosome 16 that was disrupted in a family with 
APCD {Cell, 77161:881-94 (1994) (Published errata appear in Cell 1994 
Aug 26;78(4):following 724 and 1995 Jun 30;81(7):following 1170); 

25 and Cell, 81121:289-98 (1995)). The protein encoded by the PKD1 gene 
is an integral membrane protein involved in cell-cell interactions and cell- 
matrix interactions. The role of PKD1 in the normal cell may be linked to. 
microtubule-mediated functions, such as the placement of Na( + ), K( + )' 
ATPase ion pumps in the membrane. Programmed cell death, or 

30 apoptosis, may also be invoked in APKD. 

In a specific embodiment, the adult polycystic kidney disease to be 
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predicted or diagnosed according to the present method is associated 
with a mutation in PKD1 . 

g. Wilson's disease 

Wilson's disease is a rare autosomal recessive disorder of copper 
5 transport, resulting in copper accumulation and toxicity to the liver and 
brain. Liver disease is the most common symptom in children; 
neurological disease is most common in young adults. The cornea of the 
eye can also be affected: the 'Kayser-Fleischer ring' is a deep copper- 
colored ring at the periphery of the cornea, and is thought to represent 

10 copper deposits. The gene for Wilson's disease (ATP7B) was mapped to 
chromosome 1 3. The sequence of the gene was found to be similar to 
sections of the genei defective in Menkes disease, another disease caused 
by defects in copper transport. The similar sequences code for copper- 
binding regions, which are part of a transmembrane pump called a P-type - 

1 5 ATPase that is very similar to the Menkes disease protein (Bull et aL, Nat. 
Genet., 5i41.:327-37 (1993) (Published erratum appears in Nat Genet 
1994 Feb;6(2):214). 

In a specific embodiment, the Wilson's disease to be predicted or 
diagnosed according to the present method is associated with a mutation 

20 in ATP7B. 

h. Zellweger syndrome 

Zellweger syndrome is a rare hereditary disorder affecting infants, 
and usually results in death. Unusual problems in prenatal development, 
an enlarged liver, high levels of iron and copper in the blood, and vision 

25 disturbances are among the major manifestations of Zellweger syndrome. 
The PXR1 gene has been mapped to chromosome 12; mutations in this 
gene cause Zellweger syndrome. The PXR1 gene product is a receptor 
found on the surface of peroxisomes - microbodies found in animal cells, 
especially liver, kidney and brain cells (Dodt et al., Nat. Genet., 9(2) :1 15- 

30 25 (1995); and Marynen et al., Genomics, 30121:366-8 (1995)). The 
PXR1 receptor is vital for the import of these enzymes into the 
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peroxisomes: without it functioning properly, the peroxisomes can not 
use the enzymes to carry out their important functions, such as cellular 
lipid metabolism and metabolic oxidations. 

In a specific embodiment, the Zellweger syndrome to be predicted 
5 or diagnosed according to the present method is associated with .a 
mutation in PXR1 . 

8. Infections 

Any infections by pathological agents can be predicted or 
diagnosed using the present methods. For example, infections by 
10 viruses, eubacteria, archaebacteria and eukaryotic pathogens can be 
predicted or diagnosed using the present methods. 

In a specific embodiment, the viral infection to be predicted or 
diagnosed according to the present method is caused by a Delta virus, a 
dsDNA virus, a retroid virus, a satellite virus, a ssDNA virus, a ssRNA 
1 5 negative-strand virus, ssRNA positive-strand virus (no DNA stage) or a 
bacteriophage. 

In another specific embodiment, the eubacteria infection to be 
predicted or diagnosed according to the present method is caused by a 
green bacteria, a flavobacteria, a spirochetes, a purple bacteria, a gram- 

20 positive bacteria, a gram-negative bacteria, a cynobacteria, a deinococci 
or a thermotogale. 

In still another specific embodiment, the archaebacteria infection to 
be predicted or diagnosed according to the present method is caused by 
an extreme halophile, a methanogen or an extreme thermophile. 

25 In yet another specific embodiment, the infection to be predicted or 

diagnosed according to the present method is caused by an eukaryotic 
pathogen such as a fungi, a ciliate, a cellular slime mode, a flagellate or a 
microsporidia. 

D. IVIETHODS FOR DETECTING POLYMORPHISMS 

30 Provided herein is a method for detecting polymorphism in a locus, 

which method comprises: a) hybridizing a target strand of a nucleic acid 
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comprising a locus to be tested with a complementary reference strand of 
a nucleic acid comprising a known allele of the locus, whereby the allelic 
identity between the target ^nd the reference strands results in the 
formation of a nucleic acid duplex without an abnormal base-pairing and 
5 the allelic difference between the target and the reference strands results 
in the formation of a nucleic acid duplex with an abnormal base-pairing; 
b) contacting the nucleic acid duplex formed in step a) with a mutant 
DNA repair enzyme or complex thereof, wherein the mutant DNA repair 
enzyme or complex thereof has binding affinity for the abnormal base- 

10 pairing in the duplex but has attenuated catalytic activity; and c) 

detecting binding between the nucleic acid duplex and the mutant DNA 
repair enzyme or complex thereof, whereby the polymorphism in the 
locus is assessed. 

In a specific embodiment, the polymorphism to be detected is a 

1 5 variable nucleotide type polymorphism ("VNTR"). 

In another specific embodiment, the polymorphism to be detected 
is a single nucleotide polymorphism (SNP). Preferably, a polymorphism in 
a genome, e.g., a viral, bacterial, eukaryotic, mammalian or human 
genome, is detected by the present methods. More preferably, the* 

20 human genome SNPs listed in the following Table 2 can be detected by 
the present methods (see e.g., http://www.ncbi.nlm.gov/SNP). 
Table 2. Examples .of human genome polymorphisms 

FINE MAP dbSNP HANDLE | LOCAL 

25 CHROMOSOME LOCATION 

1 0.00 cR from top of Chr1 linka 

1 0.00 cR from top of Chr1 llnka 

1 0.60 cR from top of Chr1 linka 

30 1 6.20 cR from top of Chrl linka 

1 7.8 cR from top of Chrl linkag 

1 1 2.1 cR from top of Chr1 linka 

1 1 6.40 cR from top of Chrl link 

1 21 .2 cR from top of Chrl linka 

35 1 23.3 cR from top of Chrl linka 

1 27.10 cR from top of Chrl link 

1 33.30 cR from top of Chr1 link 



ASSAY ID 


SNP ID 


1946 


WIAF 


WIAF-3885 


2870 


WIAF 


WIAF-768 


1196 


WIAF 


WIAF-2083 


1861 


WIAF 


WIAF-3800 


2383 


WIAF 


WIAF-2674 


3083 


WIAF 


WIAF-984 


1921 


WIAF 


WIAF-3860 


3061 


WIAF 


WIAF-962 


2762 


WIAF 


WIAF-501 


1421 


WIAF 


WIAF-3349 


2934 


WIAF 


WIAF-833 
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CHROMOSOME 



FINE MAP 
LOCATION 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 



34.50 cR from top of Chrl link 3318 WIAF 

50.0 cR from top of Chrl linka 2566 WIAF 

50.40 cR from top of Chr1 link 1954 WIAF 

51 .20 cR from top of Chrl link 3248 WIAF 

54.9 cR from top of Chrl linka 3124 WIAF 

55.5 cR from top of Chrl linka 2576 WIAF 

55.80 cR from top of Chrl link 1 1 30 WIAF 

55.80 cR from top of Chrl link 1131 WIAF 

55.80 cR from top of Chrl link 2951 WIAF 

55.90 cR from top of Chrl link 670 WIAF 

57.00 cR from top of Chrl link 3255 WIAF 

59.80 cR from top of Chr1 link 2526 WIAF 

60 cM 4319 UWGC 

60.70 cR from top of Chrl link 1498 WIAF 

62.8 cR from tpp of Chrl linka 2079 WIAF 

68.5 cR from top of Chrl linka 3138 WIAF 

69.00 cR from top of Chrl link 3043 WIAF 

71 .30 cR from top of Chrl link 31 88 WIAF 

75.30 cR from top of Chrl link 3479 WIAF 

75.90 cR from top of Chrl link 1886 WIAF 

77.20 cR from top of Chrl link 1275 WIAF 

77.90 cR from top of Chrl link 677 WIAF 

78.30 cR from top of Chrl link 2876 WIAF 

78.60 cR from top of Chrl link 1 179 WIAF 

84.30 cR from top of Chrl link 1756 WIAF 

91 .5 cR from top of Chrl linka 743 WIAF 

92.60 cR from top of Chrl link 1388 WIAF 

97.8 cR from top of Chrl linka 2273 WIAF 

1 03.20 cR from top of Chrl lln 1 622 WIAF 

103.20 cR from top of Chrl lih 1626 WIAF 

106.90 cR from top of Chrl lln 1 577 WIAF 

1 1 3.3 cR from top of Chrl link 2554 WIAF 

1 1 7.4 cR from top of Chrl link 975 WIAF 
118.70cRfromtopof Chrl lln 2527 WIAF 
1 1 8.70 cR from top of Chrl lin 1 952 WIAF 
1 19.10 cR from top of Chrl lin 2032 WIAF 
120.30cRfromtopof Chrl lin 3229 WIAF 
1 29.30 cR from top of Chrl lin 1 873 WIAF 
1 29.30 cR from top of Chr1 lln 1 876 WIAF 
1 29.30 cR from top of Chrl lin 1 877 WIAF 
1 29.40 cR from top of Chrl lin 1 1 57 WIAF 
1 41 .60 cR from top of Chrl lin 1110 WIAF 
142.9 cR from top of Chrl link 2123 WIAF 

142.9 cR from top of Chr1 link 2124 WIAF 
1 46.90 cR from top of Chrl lin 1 859 WIAF 
147.90cRfromtopof Chrl lin 3552 WIAF 
147.90cRfromtopof Chrl lin 1693 WIAF 

148.10 cR from top of Chrl lin 3053 WIAF 
1 48.30 cR from top of Chrl lin 1 1 86 WIAF 
1 54.00 cR from top of Chrl lin 1 263 WIAF 



WIAF- 

WIAF' 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

I 138 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 



1771 

•195 

■3893 

1663 

•1025 

•206 

1577 

1578 

850 

1348 

1677 

135 

3437 

28 

1039 

944 

1504 

1934 

3825 

2162 

1443 

774 

1708 

3695 

1191 

3293 

734 

3561 

3565 

3516 

178 

1388 

136 

3891 

1590 

1630 

3812 

3815 

3816 

1642 

1543 

298 

299 

3798 

2007 

3632 

954 

2073 

2150 
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CHROMOSOME 



FINE MAP 
LOCATION 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 



10 



15 



20 



25 



30 



35 



40 



45 



50 



156.10 cR from top of Chrl lin 1266 WiAF 1 WIAF-2153 

156.10 cR from top of Chrl lin 1267 WIAF | WIAF-2154 

160.30 cR from top of Chr1 lin 1945 WIAF | WIAF-3884 

160.50 cR from top of Chrl lin 1369 WIAF | WIAF-3272 

161.9 cR from top of Chrl link 1077 WIAF j WIAF-2040 
162.40 cR from top of Chrl lin 1140 WIAF | WIAF-1603 
162.90 cR from top of Chrl lin 3038 WIAF j WIAF-939 

164.10 cR from top of Chrl lin 3574 WIAF | WIAF-2029 
164.10 cR from top of Chrl lin 3575 WIAF | WIAF-2030 
164.10 cR from top of Chrl lin 1357 WIAF 1 WIAF-3260 
164.60 cR from top of Chrl lin 1566 WIAF j WIAF-3505 
166.90 cR from top of Chrl lin 3466 WIAF | WIAF-1921 
168.60 cR from top of Chrl lin 1295 WIAF j WIAF-2182 
168.60 cR from top of Chrl lin 1296 WIAF | WIAF-2183 
169.40 cR from top of Chrl lin 1930 WIAF | WIAF-3869 
170.30 cR from top of Chrl lin 1641 WIAF | WIAF-3580 
170.30 cR from top of Chrl lin 1644 WIAF | WIAF-3583 
171 .5 cR from top of Chrl link 2853 WIAF j WIAF'740 
174.50 cR from top of Chr1 lin 1751 WIAF j WIAF-3690 
182.20 oR from top of Chr1 lin 1731 WIAF | WIAF-3670 
182.30 cR from top of Chr1 lin 2034 WIAF j WlAF-1595 
182.80 cR from top of Chrl lin 3437 WIAF j WIAF-1892 
183.30 cR from top of Chrl lin 1982 WIAF | WIAF-3921 
183.8 cR from top of Chrl link 3593 WIAF j WIAF-2069 
187.20 cR from top of Chr1 lin 2450 WIAF | WIAF-38 
188.30 cR from top of Chrl lin 2868 WIAF | WIAF-766 
191 .30 cR from top of Chrl lin 1521 WIAF j WIAF-3460 
192.40 cR from top of Chrl lin 1458 WIAF j WIAF-3391 
192.50 cR from top of Chrl lin 1445 WIAF | WIAF-3375 
198.30 cR from top of Chr1 lin 1360 WIAF j WIAF-3263 
198.7 cR from top of Chrl link 2224 WIAF | WIAF-653 
199.30 cR from top of Chrl lin 3393 WIAF j WlAF-1848 
200.80 cR from top of Chrl lin 1224 WIAF | WIAF-21 1 1 
201 .00 cR from top of Chrl lin 1245 WIAF | WIAF-21 32 
204.40 cR from top of Chrl lin 1235 WIAF | WIAF-21 22 
209.90 cR from top of Chrl lin 291 1 WIAF j WIAF-809 
213.0 cR from top of Chrl link 983 WIAF j WIAF-1409 
216.50 cR from top of Chrl lin 1477 WIAF j WIAF-3415 
217.60 cR from top of Chrl lin 1995 WIAF | WIAF-3934 
218.0 cR from top of Chrl link 2947 WIAF | WIAF-846 
221 .70 cR from top of Chrl lin 1 191 WIAF j WIAF-2078 
224.60 cR from top of Chrl lin 2006 WIAF j WIAF-1470 
224.70 cR from top of Chrl lin 1823 WIAF | WIAF-3762 
228.50 cR from top of Chrl iln 1 585 WIAF | WIAF-3524 
228.50 cR from top of Chr1 lin 1 590 WIAF | WIAF-3529 
231 .2 cR from top of Chrl link 3142 WIAF | WIAF-1043 
231.2 cR from top of Chrl link 3544 WIAF j WIAF-1999 
232.00 cR from top of Chr1 lin 3326 WIAF | WIAF-1779 
232,40 cR from top of Chrl Iln 3518 WIAF | WIAF-1973 
235.30 cR from top of Chrl lin 1262 WIAF j WIAF-21 49 
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236.3 cR from top of Chrl link 
246.20 cR from top of Chrl iin 
247.30 cR from top of Chrl Iin 

247.4 cR from top of Chr1 link 

247.4 cR from top of Chrl link 
248.00 cR from top of Chrl Iin 
249.00 cR from top of Chrl Iin 
249.80 cR from top of Chrl Iin 
249.80 cR from top of Chrl Iin 
250.10 cR from top of Chr1 Iin 
250.10 cR from top of Chr1 Iin 
251 .00 cR from top of Chrl Iin 

253.2 cR from top of Chrl link 
254.7 cR from top of Chr1 link 

254.7 cR from top of Chrl link 
256,10 cR from top of Chrl Iin 
258.70 cR from top of Chrl Iin 

263.8 cR from top of Chrl link 
273.20 cR from top of Chrl Iin 

281 .00 cR from top of Chr1 Iin 
282.70 cR from top of Chrl Iin 

284.3 cR from top of Chrl link 

286.6 cR from top of Chrl link 
292,70 cR from top of Chrl Iin 

369.7 cR from top of Chrl link 

454.8 cR from top of Chrl link 
458.7 cR from top of Chrl link 
477-3 cR from top of Chrl link 

557.1 cR from top of Chrl link 

573.5 cR from top of Chrl link 

629.9 cR from top of Chrl link 
639.0 cR from top of Chrl link 

646.6 cR from top of Chrl link 

674.3 cR from top of Chrl link 

675.4 cR from top of Chrl link 

676.5 cR from top of Chrl link 
676.5 cR from top of Chrl link 
680.0 cR from top of Chrl link 
80.0 cR from top of Chrl link 



680.0 cR from top of Chrl I 
680.0 cR from top of Chrl I 
684.2 cR from top of Chrl I 
684.2 cR from top of Chrl I 
692.5 cR from top of Chrl li 
695.0 cR from top of Chr1 1 
702.0 cR from top of Chr1 I 
732.4 cR from top of Chrl I 
749.9 cR from top of Chrl li 
759.2 cR from top of Chrl li 
769.0 cR from top of Chrl li 



nk 
nk 
nk 
nk 
ink 
nk 
nk 
nk 
nk 
nk 
nk 



2877 
1491 
1747 
2654 
2655 
1211 
1508 
3112 
3113 

704 
1113 
3559 
3399 
2643 
2966 
1102 
1185 
3295 
1236 
3224 
3348 
3388 
2075 
1630 
2941 
2910 
2462 
3922 
2381 
2741 
3592 

772 
1078 
3856 
2482 
2555 
3501 
4585 
4558 
4559 

759 
3067 
3068 
2715 
2959 
2623 
2223 
2250 
2586 
2810 



WIAF 1 
WIAF 1 
WIAF 1 
WIAF 1 
WIAF I 
WIAF I 
WIAF i 
WIAF I 
WIAF I 
WIAF I 
WIAF 1 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF t 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
HU-CHI 
HU-CHI 
HU-CHl 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 
WIAF I 



WIAF-775 
WIAF-3430 
WIAF-3686 
WIAF-328 
WIAF-329 
WIAF-2098 
WIAF-3447 
WIAF-1013 
WIAF-1014 
WIAF-1 344 
WIAF-1 548 
WIAF-2014 
WIAF-1 854 
WIAF-312 
WIAF-866 
WIAF-1 521 
WIAF-2072 
WIAF-1 748 
WIAF-2123 
WIAF-1 61 6 
WIAF-1 801 
WIAF-1 842 
WIAF-1 1 
WIAF-3569 
WIAF-840 
WIAF-808 
WIAF-53 
WIAF-4010 
WIAF-2667 
I WIAF-455 
WIAF-2068 
WIAF- 1403 
WIAF-2044 
WfAF-2644 
WIAF-79 
WIAF-1 79 
WIAF-1 956 
NA I 1-1328 
NA I 1-1328- 
NA I 1-1328- 
WIAF-1 328 
WIAF-968 
WIAF-969 
WIAF-413 
WIAF-858 
WIAF-282 
WIAF-652 
WIAF-696 
WIAF-221 
WIAF.590 
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769.1 cR 
770.3 cR 

781.7 cR 

783.2 cR 

817.5 cR 

819.6 cR 
820.1 cR 

820.1 cR 

823.3 cR 
823.3 cR 
'838.6 cR 

873.2 cR 
873.2 cR 
875.1 cR 

883.1 cR 

884.8 cR 

889.8 cR 

890.2 cR 

890.3 cR 

910.7 cR 
917.7 cR 

943.9 cR 

947.6 cR 

951.7 cR 
959.3 cR 
959.3 cR 

961.2 cR 

961.3 cR 

961.3 cR 

962.8 cR 
969.0 cR 

980.4 cR 

980.4 cR 

996.9 cR 

998.5 cR 



from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 
from 



top o 
top o 
top o 
top o 
top o 
top o 
top o 
top o 
top o 
top o 
top 0 
top o 
top o 
top o 
top 0 
top o 
top o 
top o 
top 0 
top 0 
top o 
top o 
top o 
top o 
top 0 
top 0 
top o 
top o 
top o 
top o 
top 0 



top o 
top o 



►f Chr 


1 li 


►f Chr 


'1 li 


if Chr 


■1 r 


»f Chr 


■1 li 


»f Chr 


•1 li 


if Chr 


•1 1 


if Chr 


•1 1 


if Chr 


'1 1 


>f Chr 


'1 1 


)i Chr 


'1 1 


If Chr 


'1 1 


If Chr 


•1 1 


if Chr 


'1 1 


)f Chr 


'1 1 


>f Chi 


'1 1 


if Chi 


'1 I 


If Chr 


•1 Ii 


If Chr 


'1 li 


if Chr 


•1 1 


if Chr 


■1 1 


if Chr 


•1 1 


if Chr 


'1 li 


If Chr 


'1 h 


if Chr 


-1 1 


If Chr 


'1 1 


If Chr 


•1 1 


If Chr 


•1 1 


if Chr 


'1 1 


If Chi 


•1 I 


If Chi 


•1 1 


if Chr 


'1 1 


>f Chi 


'1 1 


>f Chi 


'1 1 


if Chr 


^1 1 


If Chr 


'1 1 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 

link 769 WIAF 1 WIAF-1389 
link 3448 WIAF | WIAF-1903 
link 3004 WIAF j WIAF-904 
link 2086 WIAF | WIAF-95 
link 976 WIAF | WIAF-1390 
link 3395 WIAF | WIAF-1850 
link 895 WIAF | WIAF-1143 
link 1006 WIAF | WIAF-4029 
link 2088 WIAF | WIAF-102 
link 2089 WIAF | WIAF-IOS 
link 2232 WIAF | WIAF-665 
link 2618 WIAF j WIAF-269 
link 2619 WIAF | WIAF-270 
link 3850 WIAF | WIAF-2636 
link 2540 WIAF | WlAF-154 
link 2867 WIAF | WIAF-765 
link 3051 WIAF | WIAF:952 
link 3116 WIAF | WIAF-i017 
link 3841 WIAF | WIAF-2617 
link 2983 WIAF | WIAF-883 
link 3042 WIAF | WIAF-943 
link 2525 WIAF | WIAF-134 
link 2885 WIAF | WIAF-783 
link 2935 WIAF | WIAF-834 
link 3283 WIAF | WIAF-1736 
link 2424 WIAF | WIAF-4 
link 2570 WIAF j WIAF-200 
link 2782 WIAF | WIAF-531 
link 2479 WIAF j WIAF-75 
link 2637 WIAF | WIAF-297 
link 3114 WIAF j WIAF-1015 
link 2976 WIAF | WIAF-876 
link 2977 WIAF j WIAF-877 
link ' 2897 WIAF j WIAF-795 
link 2541 WIAF | WIAF-155 
4221 MARSHFIELD | MID- 
4222 MARSHFIELD | MID- 
3996 SHGC/AFFYMETRIX 
4004 SHGC/AFFYMETRIX 
4155 SHGC/AFFYMETRIX 
4082 SHGC/AFFYMETRIX 
4098 SHGC/AFFYMETRIX 
4037 SHGC/AFFYMETRIX 
4041 SHGC/AFFYMETRIX 
4043 SHGC/AFFYMETRIX 
4049 SHGC/AFFYMETRIX 
3117 WIAF I WIAF-1018 

3203 WIAF 1 WIAF-1 546 

3204 WIAF I WIAF-1 547 
3222 WIAF j WIAF-1 610 



13 
14 

I SNP- 
I SNP- 
I SNP. 
I SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 



SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 



10870 

12999 

14385 

16847 

18912 

8109 

8491 

8995 

9374 



-108- 



wo 01/62968 



PCT/USOl/00452 



CHROMOSOME 



FINE MAP 
LQCATiON 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 



10 
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2 0.00 cR from top of Chr2 linka 

2 6 cM 

2 6.00 cR from top of Chr2 linka 

2 6.00 cR from top of Chr2 linka 

2 9.40 cR from top of Chr2 linka 

2 1 2, 1 0 cR from top of Chr2 link 

2 1 2. 1 0 cR from top of Chr2 link 

2 24.50 cR from top of Chr2 link 

2 32.90 cR from top of Chr2 link 

2 36.60 cR from top of Chr2 link 

2 40.20 cR from top of Chr2 link 

2 41 .5 cR from top of Chr2 linka 

2 44.40 cR from top of Chr2 link 

2 44.6 cR from top of Chr2 linka 

2 46.00 cR from top of Chr2 link 

2 46.1 cR from top of Chr2 linka 

2 47.90 cR from top of Chr2 link 

2 47.90 cR from top of Chr2 link 

2 50.30 cR from top of Chr2 link 

2 50.70 cR from top of Chr2 link 

2 51 .1 0 cR from top of Chr2 link 

2 51 .40 cR from top of Chr2 link 

2 51 .40 cR from top of .Chr2 link 

2 54.7 cR from top of Chr2 linka 

2 55.00 cR from top of Chr2 link 

2 64.90 cR from top of Chr2 link 

2 64.90 cR from top of Chr2 link 

2 66.80 cR from top of Chr2 link 

2 69.00 cR from top of Chr2 link 

2 70.30 cR from top of Chr2 link 

2 70.30 cR from top of Chr2 link 

2 70.60 cR from top of Chr2 link 

2 71 .70 cR from top of Chr2 link 

2 76.60 cR from top of Chr2 link 

2 79.70 cR from top of Chr2 link 

2 82.20 cR from top of Chr2 link 

2 84.8 cR from top of Chr2 linka 

2 87.1 0 cR from top of Chr2 link 

2 89.70 cR from top of Chr2 link 



3315 
3432 
3515 
3578 
1519 
3887 
3914 
3915 
2955 
2969 

2010 
4326 

706 
1446 
2676 
3383 
3384 
1276 
1334 
1201 
1203 
2517 

698 
2750 
1228 
3385 
3236 
3237 
1420 
1129 
2925 
3223 
1311 
3033 
1975 
3345 
1529 
2014 
1177 
1920 
1922 
2023 
1347 
1104 
1257 
1694 
2850 
1599 
1280 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 

WIAF 

UWGC 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



1768 

1887 

1970 

2033 

3458 

3948 

3998 

4000 

854 

869 



WIAF-1492 
I 145 

WIAF-1363 

WIAF-3376 

WIAF-358 

WIAF-1836 

WIAF-1837 

WIAF-2163 

WIAF-2224 

WIAF-2088 

WIAF-2090 

WIAF-125 

WIAF-1268 

WIAF-469 

WIAF-2115 

WIAF-1839 

WIAF.1645 

WIAF- 1646 

WIAF-3348 

WIAF-1573 

WIAF-824 

WIAF-1612 

WIAF-2200 

WIAF-933 

WIAF-3914 

WIAF-1798 

WIAF-3468 

WIAF-1 508 

WIAF-1705 

WIAF-3859 

WIAF-3861 

WIAF-1 562 

WIAF-3250 

WIAF-1 528 

WIAF-2144 

WIAF-3633 

WIAF-714 

WIAF-3538 

WIAF-2167 
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2 89.70 cR from top of Chr2 link 1 594 WIAF 

2 90.10cRfromtopof Chr2link 692 WIAF 

2 91.60 cR from top of Chf'2 link 1412 WIAF 

2 92.2 cR from top of Chr2 linka 3103 WIAF 

5 2 92.20 cR from top of. Chr2 link 1423 WIAF 

2 93.80 cR from top of Chr2 link 1243 WIAF 

2 96.00 cR from top of Chr2 link 1162 WIAF 

2 lOe.lOcRfromtopof Chr2Iin 3324 WIAF 

2 106.10 cR from top of Chr2lin 1955 WIAF 

10 2 1 1 0.00 cR from top of Chr2lin 1684 WIAF 

2 1 1 2.40 cR from top of Chr2 lin 1611 WIAF 

2 1 1 2.40 cR from top of Chr2 lin 1613- WIAF 

2 1 1 5.30 cR from top of Chr2 lin 1 286 WIAF 

2 1 1 5.30 cR from top of Chr2 lin 1 287 WIAF 

15 2 1 17.60 cR from top of Chr2 lin 3509 WIAF 

2 117.60 cR from top of Chr2 lin 3510 WIAF 

2 1 1 8.60 cR from top of Chr2 lin 1 327 WIAF 

2 1 1 8.80 cR from top of Chr2 lin 3458 WIAF 

2 118.80 oR from top of Chr2 lin 3459 WIAF 

20 2 118.80 cR from top of Chr2 lin 3460 WIAF 

2 1 1 9.20 cR from top of Chr2 lin 201 7 WIAF 

2 1 1 9.30 cR from top of Chr2 lin 1 653 WIAF 

2 122.40 cR from top of Chr2 lin 702 WIAF 

2 1 23. 1 0 cR from top of Chr2 lin 1 503 WIAF 

25 2 1 23. 1 0 cR from top of Chr2 lin 1 504 WIAF 

2 123.4 cR from top of Chr2 link 2638 WIAF 

2 124.50 cRfromtopof Chr2lin 3014 WIAF 

2 134.30 cR from top of Chr2 lin 1091 WIAF 

2 134.30 cR from top of Chr2 lin 1915 WIAF 

30 2 135.80 cR from top of Chr2 lin 1724 WIAF 

2 149.50 cR from top of Chr2 lin 1617 WIAF 

2 152.6 cR from top of Chr2 link 2284 WIAF 

2 1 58.40 cR from top of Chr2 lin 3208 WIAF 

2 1 58.40 cR from top of Chr2 lin 3209 WIAF 

35 2 1 59.40 cR from top of Chr2 lin 1824 WIAF 

2 1 62.90 cR from top of Chr2 lin 1699 WIAF 

2 164,60cRfromtopofChr2 lin 1947 WIAF 

2 1 66.4 cR from top of Chr2 link 3054 WIAF 

2 166.50 cR from top of Chr2 lin 3173 WIAF 

40 2 169.10 cR from top of Chr2 lin 1455 WIAF 

2 180.30 cR from top of Chr2 lin 1368 WIAF 

2 188.20 cR from top of Chr2 lin 1728 WIAF 

2 188.40 cR from top of Chr2 lin 3431 WIAF 

2 188.60 cR from top of Chr2 lin 1206 WIAF 

45 2 188.70 cR from top of Chr2 lin 1356 WIAF 

2 190.80 cR from top of Chr2 lin 1677 WIAF 

2 191.20 cR from top of Chr2 lin 2025 WIAF 

2 191.20 cR from top of Chr2 lin 1164 WIAF 

2 1 91 .40 cR from top of Chr2 lin 1 509 WIAF 

50 2 191.50 cR from top of Chr2 lin 2636 WIAF 



WIAF-3533 

WIAF-1226 

WIAF-3333 

WIAF-1004 

WIAF-3351 

WIAF-2130 

WIAF-1665 

WIAF-1777 

WIAF-3894 

WIAF-3623 

WIAF-3550 

WIAF-3552 

WIAF-2173 

WIAF-2174 

WIAF-1964 

WIAF-1965 

WIAF-2217 

WIAF-1913 

WIAF-1914 

WIAF-1915 

WiAF-1518 

WIAF-3592 

WIAF-1311 

WIAF-3442 

WIAF-3443 

WIAF-304 

WIAF-914 

WIAF-1467 

WIAF-3854 

WIAF-3663 

WIAF-3556 

WIAF-757 

WIAF-1559 

WIAF- 1560 

WIAF-3763 

WIAF-3638 

WIAF-3886 

WIAF-955 

W/AF-1487 

WIAF-3388 

WIAF-.3271 

WIAF-3667 

WIAF-1 886 

WIAF-2093 

WIAF-3259 

WIAF-3616 

WIAF-1 570 

WIAF-1 675 

WIAF-3448 

WIAF-296 
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2 1 92.9 cR from top of Chr2 link 2454 WIAF 

2 192.9 cR from top of Chr2 link 2455 WIAF 

2 195.10 cR from top of Chr2 lin 1193 WIAF 

2 200.30 cR from top of Chr2 lin 1248 WIAF 

5 2 200.40 cR from top of Chr2 lin 1619 WIAF 

2 201 .5 cR from top of Chr2 link 2968 WIAF 

2 202.7 cR from top of Chr2 link 2503 WIAF 

2 208.30 cR from top of Chr2 lin 1676 WIAF 

2 208.30 cR from top of Chr2 lin 1 678 WIAF 

10 2 213.00 cR from top of Chr2 lin 3813 WIAF 

2 214.50 cR from top of Chr2 lin 3487 WIAF 

2 219.30 cR from top of Chr2 lin 1288 WIAF 

2 219.30 cR from top of Chr2 lin 1289 WIAF 

2 220.10 cR from top of Chr2 lin 1736 WIAF 

15 2 221 .1 cR from top of Chr2 link 909 WIAF 

2 221.1 cR from top of Chr2 link 1046 WIAF 

2 221.50 cR from top of Chr2 lin 3310 WIAF 

2 222.6 cR from top of Chr2 link 3321 WIAF 

2 223.40 cR from top of Chr2 lin 351 2 WIAF 

20 2 229.80 cR from top of Chr2 lin 1510 WIAF 

2 229.80 cR from top of Chr2 lin 1511 WIAF 

2 234.50 cR from top of Chr2 lin 1523 WIAF 

2 236.10 cR from top of Chr2 lin- 2020 WIAF 

2 236.10 oR from top of Chr2 lin 1844 WIAF 

25 2 236.10 cR from top of Chr2 lin 1846 WIAF 

2 240.20 cR from top of Chr2 lin 1384 WIAF 

2 242.40 cR from top of Chr2 lin 1663 WIAF 

2 246.10 cR from top of Chr2 lin 1303 WIAF 

2 247.10 cR from top of Chr2 lin 713 WIAF 

30 2 247.20 cR from top of Chr2 lin 1502 WIAF 

2 253.00 cR from top of Chr2 lin 1309 WIAF 

2 269.50 cR from top of Chr2 lin 1750 WIAF 

2 272.50 cR from top of Chr2 lin 1534 WIAF 

2 272.50 oR from top of Chr2 lin 1702 WIAF 

35 2 272.60 cR from top of Chr2 lin 2875 WIAF 

2 278.8 cR from top of Chr2 link 3825 WIAF 

2 285.6 cR from top of Chr2 link 3539 WIAF 

2 285.7 cR from top of Chr2 link 3849 WIAF 

. 2 287.2 cR from top of Chr2 link 3587 WIAF 

40 2 287.2 cR from top of Chr2 link 1071 WIAF 

2 290.4 cR from top of Chr2 link 2697 WIAF 

2 293.7 cR from top of Chr2 link 2154 WIAF 

2 293.7 cR from top of Chr2 link 2155 WIAF 

2 300.1 cR frorh top of Chr2 link 2923 WIAF 

45 2 31 8.2 cR from top of Chr2 link 966 WIAF 

2 325.6 oR from top of Chr2 link 4081 

2 341 .6 cR from top of Chr2 link 2281 WIAF 

2 375.3 cR from top of Chr2 link 2863 WIAF 

2 742.6 cR from top of Chr2 link 2213 WIAF 

50 2 750.0 cR from top of Chr2 link 2639 WIAF 



WIAF- 
WIAF' 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF" 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



45, 
46 

2080 

2135 

3558 

868 

107 

3615 

3617 

2565 

1942 

2175 

2176 

3675 

1184 

4141 

1763 

1774 

1967 

3449 

3450 

3462 

1526 

3783 

3785 

3289 

3602 

2192 

1451 

3441 

2198 

3689 

3473 

3641 

773 

2590 

1994 

2635 

2052 

4203 

383 

486 

487 

822 

1371 



SHGC/AFFYMETRIX | SNP-SHGC-1 6802 



WIAF- 
WIAF- 
WIAF- 
WIAF- 



751 
760 
635 
305 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



750.1 cR from top o 
758.7 cR from top o 
758.7 cR from top o 
780.9 cR from top o 
783.9 cR from top o 
796.0 cR from top o 
824.7 cR from top o 

838.5 cR from top o 
854.9 cR from top o 

881.0 cR from top o 

900.2 cR from top o 

902.1 cR from top o 

910.6 cR from top o 
910.6 cR from top o 

910.6 cR from top o 

915.7 cR from top o 
920,4 cR from top o 
931.1 cR from top o 

952.3 cR from top o 

956.7 cR from top o 

961.8 cR from top o 
981.1 cR from top o 

986.9 cR from top o 
986.9 cR from top o 
1009.4 cR from top 
1026.1 cR from top 
1026.1 cR from top 
1074.0 cR from top 
1089.0 cR from top 
1092.0 cR from top 
1 092.0 cR from top 
1104.9 cR from top 



Chr2 I 
Chr2 I 
Chr2 I 
Chr2 I 
Chr2 i 
Chr2 I 
Chr2 1 
Chr2 I 
Chr2 \ 
Chr2l 
Chr2 1 
Chr2l 
Chr2I 
Chr2l 
Chr2 li 
Chr2 I 
Chr2l 
Chr2l 
Chr2 I 
Chr2 I 
Chr2 I 
Chr2 li 
Chr2 I 
Chr2 I 
of Chr2 lin 
Qf Chr2 lin 
of Chr2 lin 
of Chr2 lin 
of Chr2 lin 
of Chr2 lin 
of Chr2 lin 
of Chr2 lin 



ink 
ink 
nk 
ink 
ink 
nk 
nk 
nk 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
ink 
nk 
ink 
ink 
nk 



2954 WIAF 1 WIAF-853 

893 WIAF 1 WIAF-1140 

1059 WIAF I WIAF-4175 

2253 WIAF I WIAF-701 

2673 WIAF I WIAF-353 

2493 WIAF I WIAF-91 

2472 WIAF I WIAF-66 

2194 WIAF I WIAF-594 

2209 WIAF I WIAF-629 

2878 WIAF I WIAF-776 

3858 WIAF I WIAF-2647 

2661 WIAF I WIAF-337 

2961 WIAF I WIAF-860 

2962 WIAF I WIAF-861 

2963 WIAF I WIAF-862 
2500 WIAF I WIAF-99 

726 WIAF I WIAF-1066 

2104 WIAF I WIAF-177 

2516 WIAF I WIAF-124 

2629 WIAF I WIAF-289 

3109 WIAF I VVIAF-1010 

2470 WIAF I WIAF-64 

2125 WIAF I WIAF-300 

2126 WIAF I WIAF-301 
2978 WIAF I WIAF-878 

3871 WIAF I WIAF-2670 

3872 WIAF I WIAF-2671 
2738 WIAF I WIAF-450 
2052 WIAF i WIAF-1700 

3474 WIAF I WiAF-1929 

3475 WIAF I WIAF-1930 
3309 WIAF I WIAF-1762 
4223 MARSHFIELD | MID- 
4224 MARSHFIELD j MID- 
3962 SHGC/AFFYMETRIX 
4069 SHGC/AFFYMETRIX 

3967 SHGC/AFFYMETRIX 

3968 SHGC/AFFYMETRIX 
4164 SHGC/AFFYMETRIX 
4074 SHGC/AFFYMETRIX 
4087 SHGC/AFFYMETRIX 
4016 SHGC/AFFYMETRIX 
4040 SHGC/AFFYMETRIX 
4044 SHGC/AFFYMETRIX 
4048 SHGC/AFFYMETRIX 
3122 WIAF I WIAF-1023 
3130 WIAF j WIAF-1031 
3159 WIAF j WIAF-1458 
1218 WIAF I WIAF-2105 
1231 WIAF I WIAF-2118 



15 
16 

I SNP- 
I SNP- 
I SNP- 
1 SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 
I SNP- 



SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 



11130 

13615 

13867 

13934 

15247 

15661 

17089 

3987 . 

8478 

9017 

9366 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 

3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



12.90 cR from top of Chr3 link 
12.90 cR from top of Chr3 link 
14.5 cR from top of Chr3 links 
18.4 cR from top of Chr3 linka 
18.4 cR from top of Chr3 llnka 
19.3 cR from top of Chr3 linka 
33.50 cR from top of Chr3 link 
33.50 cR from top of Chr3 link 
36.50 cR from top of.Chr3 link 
36.90 cR from top of Chr3 link 
37.90 cR from top of Chr3 link 
43.20 cR from top of Chr3 link 
44.1 cR from top of Chr3 linka 
45.30 cR from top of Chr3 link 
46.90 cR from top of Chr3 link 
49.00 cR from top of Chr3 link 
49.00 cR from top of Chr3 link 

51.7 cR from top of Chr3 linka 
54.9 cR from top of Ghr3 linka 
55.0 cR from top of Chr3 linka 
55.40 cR from top of Chr3 link 
55.60 cR from top of Chr3 link 

56.8 cR from top of Ciir3 linka 
56.8 cR from top of Chr3 linka 
57.80 cR from top of Chr3 link 
57.80 cR from top of Chr3 link 
57.80 cR from top of Chr3 link 

58.00 cR from top of Chr3 link 
66.40 cR from top of Chr3 link 
66.80 cR from top of Chr3 link 
66.80 cR from top of Chr3 link 
67,20 cR from top of .Chr3 link 
67.50 cR from top of Chr3 link 
67.50 cR from top of Chr3 link 

72.1 cR from top of Chr3 linka 



1253 


WIAF 


WIAF-2140 


1254 


WIAF 


WIAF-2141 


3672 


WIAF 


WIAF-2400 


3683 


WIAF 


WIAF-241 1 


3705 


WIAF 


WIAF-2433 


3781 


WIAF 


WIAF-2509 


3782 


WIAF 


WIAF-2510 


2447 


WIAF 


WIAF-35 


2448 


WIAF 


WIAF-36 


2449 


WIAF 


WIAF-37 


2480 


WIAF 


WIAF-76 


3080 


WIAF 


WIAF-981 


3097 


WIAF 


WIAF-998 


3098 


WIAF 


WIAF-999 


1522 


WIAF 


WIAF-3461 


1524 


WIAF 


WIAF-3463 


2098 


WIAF 


WIAF-144 


3339 


WIAF 


WIAF-1792 


3340 


WIAF 


WIAF-1793 


2244 


WIAF 


WIAF-685 


3811 


WIAF 


WIAF-2563 


1926 


WIAF 


WIAF-3865 


2886 


WIAF 


WIAF-784 


1893 


WIAF 


WIAF-3832 


1142 


WIAF 


WIAF-1605 


1494 


WIAF 


WIAF-3433 


2939 


WIAF 


WIAF-838 


3491 


WIAF 


WIAF-1946 


3312 


WIAF 


WIAF-1765 


1449 


WIAF 


WIAF-3380 


1450 


WIAF 


WIAF-3382 


2191 


WIAF 


WIAF-587 


2456 


WIAF 


WIAF-47 


3863 


WIAF 


WiAF-2656 


3471 


WIAF 


WIAF-1926 


3336 


WIAF 


WIAF-1789 


2508 


WIAF 


WIAF-114 


2509 


WIAF 


WIAF-115 


2037 


WIAF 


WIAF-1617 


1825 


WIAF 


WIAF-3764 


2707 


WIAF 


WIAF-398 


2984 


WIAF 


W)AF-884 


1308 


WIAF 


WIAF-2197 


3225 


WIAF 


WIAF-1 624 


3483 


WIAF 


WIAF-1938 


683 


WIAF 


WIAF-1 074 


3245 


WIAF 


WIAF-1 655 


1602 


WIAF 


WIAF-3541 


3308 


WIAF 


WIAF-1 761 
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3 72.30 cR from top of Chr3 link 3193 WIAF 

3 72.30 cR from top of Chr3 link 3194 WIAF 

3 72.40 cR from top of Chr3 link 826 WIAF 

3 72.60 cR from top of Chr3 link 3410 WIAF 

5 3 72.8 cR from top of Chr3 linka 2622 WIAF 

3 73.3 cR from top of Chr3 linka 3868 WIAF 

3 74.00 cR from top of Chr3 link 1 595 WIAF 

3 77.40 cR from top of Chr3 link 1690 WIAF 

3 aOcM 4314 UWGC 

10 3 80.80 cR from top of Chr3 link 3378 WIAF 

3 92.80 cR from top of Chr3 link 1452 WIAF 

3 96.60 cR from top of Chr3 link 1770 WIAF 

3 1 1 1 .00 cR from top of Chr3 lln 3341 WIAF 

3 1 1 1.10 cR from top of Chr3lin 3189 WIAF 

15 3 1 1 1 , 1 0 cR from top of Chr3 lin 31 90 ' WIAF 

3 1 1 1 .40 cR from top of Chr3 lin 1 956 WIAF 

3 1 22.30 cR from top of Chr3 lin 1 549 WIAF 

3 1 24.00 cR from top of Chr3 lin 1182 WIAF 

3 1 26.3 cR from top of Chr3 link 2854 WIAF 

20 3 1 26.8 cR from top of Chr3 link 31 23 WIAF 

3 126.90 cR from top of Chr3 lin 1454 WIAF 

3 129.30 cR from top of Chr3 lin 1395 WIAF 

3 131.30 cR from top of Chr3 lin 1923 WIAF 

3 1 34.60 cR from top of Chr3 lin 1 259 WIAF 

25 3 1 34.9 cR from top of Chr3 link 917 WIAF 

3 1 34.9 cR from top of Chr3 link 918 WIAF 

3 1 34.9 cR from top of Chr3 link 91 9 WIAF 

3 1 36.00 cR from top of Chr3 lin 1931 WIAF 

3 138.00 cR from top of Chr3 lin 3228 WIAF 

30 3 138.30 cR from top of Chr3 lin 1963 WIAF 

3 138.40 cR from top of Chr3 lin 1725 WIAF 

3 140.70 cR from top of Chr3 lin 1092 WIAF 

3 141.0 cR from top of Chr3 link 3147 WIAF 

3 141.20 cR from top of Chr3 lin 1970 WIAF 

35 3 142.20 cR from top of Chr3 lin 1229 WIAF 

3 142.40 cR from top of Chr3 lin 1187 WIAF 

3 143.80 cR from top of Chr3 lin 3263 WIAF 

3 143.80 cR from top of Chr3 lin 3264 WIAF 

3 1 43.90 cR from top of Chr3 lin 1 1 95 WIAF 

40 3 144.20 cR from top of Chr3 lin 1158 WIAF 

3 144.70 cR from top of Chr3 lin 1722 WIAF 

3 147.8 cR from top of Chr3 link 2572 WIAF 

3 151.20 cR from top of Chr3 lin 1152 WIAF 

3 151.20 cR from top of Chr3 lin 1153 WIAF 

45 . 3 1 53.80 cR from top of Chr3 lin 1 890 WIAF 

3 1 56.30 cR from top of Chr3 lin 1 71 6 WIAF 

3 1 56.60 cR from top of Chr3 lin 1 734 WIAF 

3 164.20cRfromtopofChr3 lin 3296 WIAF 

3 1 64.20 cR from top of Chr3 lin 1629 WIAF 

50 3 1 66.0 cR from top of Chr3 link 2898 WIAF 



WIAF- 

WIAF 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

I 133 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF. 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 



•1522 
1523 
■1489 
1865 
-281 
2663 
•3534 
3629 

1831 

3385 

3709 

1794 

1512 

1513 

3895 

3488 

1714 

741 

1024 

3387 

3300 

3862 

2146 

1207 

1208 

1209 

3870 

1629 

3902 

3664 

1473 

1048 

3909 

2116 

2074 

1702 

1703 

2082 

1656 

3661 

202 

1636 

1637 

3829 

3655 

3673 

1749 

3568 

796 
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10 



15 



20 



25 



30 
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50 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3- 



170 
171 
173 
173 
186 
186 
187 
187 
189 
195 
228 
228 
228 
228 
228 
233 
233 
233 
233 
233 
233 
239 
240 
240 
269 
269 
463 
477 
477 
477 
533 
534 
546 
552 
569 
569 
604 
611 
616 
640 
640 
672 
672 
680 
680 
690 
718 
718 
718 
775 



20 cR from top of Chr3 lin 
7 cR from top of Chr3 link 
.30 cR from top of Chr3 lin 
50 cR from top of Chr3 lin 
60 cR from top of Chr3 lin 
J cR from top of Chr3 link 
.60 cR from top of Chr3 lin 
.9 cR from top of Chr3 link 
.80 cR from top of Chr3 lin 
.7 cR from top of Chr3 link 
.30 cR from top of Chr3 lin 
.30 cR from top of Chr3 lin 
A cR from top of Chr3 link 
.4 cR from top of Chr3 link 
.60 cR from top of Chr3 lin 
.00 cR from top of Chr3 lin 
.00 cR from top of Chr3 lin 
;00 cR from top of Chr3 lin 
.00 cR from top of Chr3 lin 
.00 cR from top of Chr3 lin 



8 cR from top o 

5 cR from top o 
4 cR from top o 
4 cR from top o 

6 cR from top o 
6 cR from top o 

3 cR from top o 
2 cR from top o 
2 cR from top o 
2 cR from top o 
1 cR from top o 

4 cR from top o 

0 cR from top o 

8 cR from top o 
6 cR from top o 

6 cR from top o 

7 cR from top o 

1 cR from top o 

0 cR from top o 
6 cR from top o 
6 cR from top o 

1 cR from top o 

1 cR from top o: 

5 cR from top o 
5 cR from top o 

2 cR from top o 
0 cR from top o 
0 cR from top o 
5 cR from top oi 

9 cR from top o1 



Chr3 link 
Chr3 link 
Chr3 link 
Chr3 link 



Chr3 li 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 1 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
.Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 I 
Chr3 li 
*Chr3 li 
Chr3 li 
Chr3 I 
Chr3 li 
Chr3 li 



nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 



Chr3 link 



1852 

2917 

1351 

1407 

1819 

2571 

1305 

2114 

1463 

3847 

1887 

1888- 

3472 

3473 

1161 

1383 

1470 

1471 

1587 

1627 

3446 

1032 

2234 

2235 

868 
1003 
2342 
4560 
4587 

741 
3839 
2855 
3085 
1753 
3354 
2440 
2358 
2905 

788 
2185 
2186 
2830 
2831 
2379 
2380 
2656 
2207 
2208 
2217 
2461 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
HU-( 
HU- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



CHINA 
CHINA 



WIAF-3791 
WIAF-816 
WIAF-3254 
WIAF-3327 . 
WIAF-3758 
WIAF-201 
WlAF-2194 
WIAF-236 
WIAF-3398 
WIAF-2626 
WIAF-3826 
WIAF-3827 
WIAF-1927 
WIAF- 1928 
WIAF-1664 
WiAF-3288 
WIAF-3405 
WIAF-3406 
WIAF-3526 
WlAF-3566 
WIAF-1901 
WIAF-4092 
WIAF-669 
WIAF-670 
WIAF-1081 
WIAF-4026 
WIAF-2572 
I 1-1176-2 
I 3-1176 
WIAF-1176 
WIAF-2612 
WIAF-745 
WIAF'986 
WIAF-3692 
WIAF-1807 
WIAF-25 
WIAF-2606 
WIAF-803 
WIAF-2056 
WIAF-568 
WIAF-569 
WIAF-650 
WIAF-651 
WIAF-2664 
WIAF-2665 
WIAF-330 
WIAF-625 
WIAF-626 
WIAF-639 
WIAF-52 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



791.4 
792.2 
793.4 
793,7 
796.7 
802.4 
808.9 
838.9 
838.9 
842.9 
848.1 
848.1 
848.1 
868.2 
868.6 
879.8 
879.8 
879.8 
896.5 
897.8 
903.2 
907.0 
907.0 
917.9 
918.0 
921.8 



cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 



of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 
of Chr3 



link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link, 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 
link 



2574 
1256 
2948 
2779 
2788 
2173 
2246 
2604 
2605 
2703 
2630 
2631 
2632 
3814 
3366 
224 
225 
766 
3333 
3451 
3360 
2513 
2514 
3078 
2543 
3106 
4225 
3998 
3999 
4138 
4067 
4147 
4151 
4156 
4162 
3970 
4089 
4097 
4106 
4012 
3974 
4107 
4035 
3144 
3146 
3530 
3740 
2061 
1500 
1673 



WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

KWOK 

KWOK 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 



WIAF-204 

WIAF-2143 

WIAF-847 

WIAF-523 

WIAF-542 

WIAF-543 

WIAF-690 

WIAF-249 

WIAF-250 

WIAF-392 

WIAF-290 

WlAF-291 

WIAF-292 

WIAF-2568 

WIAF-1819 

I D3S2344-1 

I D3S2344-2 

WIAF-1365 

WIAF-1786 

WIAF-1906 

WIAF-1813 

WIAF-119 

WIAF-120 

WIAF-979 

WIAF-162 

WIAF-1007 



MARSHFIELD I MID-17 



SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

WIAF I WIAF-1045 

WIAF I WIAF-1047 

WIAF I WIAF-1985 

WIAF I WIAF-2468 

WIAF I WIAF-2547 

WIAF I WIAF-3439 

WIAF I WIAF-3612 



SNP- 
SNP- 
SNP. 
SNP. 
SNP- 
SNP. 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 
SNP- 



SHGC 
SHGC 
SHGC- 
SHGC 
SHGC- 
SHGC- 
SHGC 
SHGC- 
SHGC- 
SHGO 
SHGC 
SHGC 
SHGC- 
SHGC- 
SHGC- 
SHGC- 



11665 

1204 

13087 

13482 

14087 

14182 

14457 

14769 

16777 

17103 

18889 

32258 

3249 

33980 

35481 

7204 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



3 
3 
3 
3 
3 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4. 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 



3925 WIAF 

2996 WIAF 

3006 WIAF 

3017 WIAF 

3048 WIAF 



WIAF-4013 

WIAF-896 

WIAF-906 

WIAF-917 

WIAF-949 



3.70 cR from top of Chr4 linka 
3.70 cR from top of Chr4 linka 
4.4 cR from top of Chr4 linkag 
4,70 cR from top of Chr4 linka 
4.70 cR from top of Chr4 linka 
4.70 cR from top of Chr4 linka 
4.70 cR from top of Chr4 linka 
5.30 cR from top of Chr4 linka 
8.60 cR from top of Clir4 linka 

1 5.00 cR from top of Chr4 link 

16.1 cR from top of Chr4 linka 
18.70 cR from top of Chr4 link 

19.8 cR from top of Chr4 linka 
22.00 cR from top of Chr4 link 
26.80 cR from top of Chr4 link 
26.80 cR from top of Chr4 link 

27.7 cR from top of Chr4 linka 
28.20 cR from top of Chr4 link 
28.90 cR from top of Chr4 link 
29.80 cR from top of Chr4 link 
29.80 cR from top of Chr4 link 
35.40 cR from top of Chr4 link 
36.90 cR from top of Chr4 link 
39.3 cR from top of Chr4 linka 
43.50 cR from top of Chr4 link 

45.9 cR from top of Clir4 linka 
51.00 cR from top of Chr4 link 
51.90 cR from top of Chr4 link 

55.2 cR from top of Chr4 linka 
55.2 cR from top of Chr4 linka 
55.2 cR from top of Chr4 linka 
55.2 cR from top of Chr4 linka 
55.2 cR from top of Chr4 linka 
58.40 cR from top of Chr4 link 

61.8 cR from top of Chr4 linka 
65.2 cR from top of Chr4 linka 
74.30 cR from top of Chr4 link 
78.50 cR from top of Chr4 link 
80.70 cR from top of Chr4 link 
91 .50 cR from top of Chr4 link 
91.70 cR from top of Chr4 link 
96.60 cR from top of Chr4 link 
98.5 cR from top of Chr4 linka 
98.5 cR from top of Chr4 linka 



1204 


WIAF 1 


WIAF-2091 


1919 


WIAF 1 


WIAF-3858 


3866 


WIAF 1 


WIAF-2660 


3215 


WIAF 1 


WIAF-1 591 


3216 


WIAF 1 


WIAF-1592 


3217 


WfAF 1 


WIAF-1 593 


1210 


WIAF 1 


WIAF-2097 

»¥ *bWW / 


1 120 


WIAF 1 


WIAF-1 555 


3233 


WIAF 1 


WIAF-1 639 

V V 1 r^f 1 V w W 


1332 


WIAF 1 


WIAF-2222 


3809 


WIAF 


WIAF-2561 


1307 


WIAF 1 


WIAF-2196 


3503 


WIAF 1 


WIAF-1 958 


3250 


WIAF 1 


WIAF-1 668 


1811 


WIAF 1 


WIAF-3750 


1814 


WIAF 1 


WIAF-3753 


2070 


WIAF 1 


WIAF-2557 


3161 


WIAF 1 


WlAF-1464 


3555 


WIAF 1 


WIAF-20ld 


3507 


WIAF 1 


WIAF-1 962 

VVI/AK 1 ww^ 


3508 


WIAF 1 


WIAF-1 963 

V V 1 1 1 W U w 


'^1 fi*^ 


WIAF 


WIAF-1 dRR 


1 520 


WIAF 


WIAF-3459 

WIAAf w^ww 


2076 


WIAF 


WIAF-1 4 


707 


WIAF 1 


WIAF-1 395 

vvi/^i 1 www 


3003 


WIAF 1 


WIAF-903 

VVI/^l www 


3470 


WIAF 1 


WIAF-1 925 

wVI/^I 1 w^w 


2021 


WIAF 1 


WIAF-1 527 


231 


KWOK 


1 D4S2341 


2092 


WIAF 1 


WIAF-1 09 


780 


WIAF 1 


WIAF-1 433 


2340 


WIAF 1 


WIAF-2566 


806 


WIAF 1 


WIAF-4142 


1475 


WIAF 1 


WIAF-3412 


2210 


WIAF 1 


WIAF-630 


2129 


WIAF 1 


WIAF-316 


1461 


WIAF 1 


WIAF-3395 


3131 


WIAF 1 


WIAF-1 032 


672 


WIAF 1 


WIAF-1 370 


1145 


WIAF 1 


WIAF-1 611 


1433 


WIAF 1 


WIAF-3361 


1097 


WIAF 1 


WIAF-1 485 


3520 


WIAF 1 


WIAF-1 975 


3521 


WIAF 1 


WIAF-1 976 
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4 98.80 cR from top of Chr4 link 1 226 

4 98.80 cR from top of Chr4 link 1227 

4 106.40 cR from top of Chr4 lin 1826 

4 1 08.70 cR from top of Chr4 lin 703 

5 4 11 2. 1 0 cR from top of Chr4 lin 1 252 

4 120.20 cR from top of . Chr4 lin 1304 

4 1 21 .00 cR from top of Chr4 lin 201 9 

4 1 21 .40 cR from top of Chr4 lin 31 65 

4 1 21 .50 cR from top of Chr4 lin 1 348 

10 4 1 21 .50 cR from top of Chr4 lin 1 871 

4 1 21 ,50 cR from top of Chr4 lin 1 872 

4 1 22.60 cR from top of Chr4 lin 1 575 

4 1 22.60 cR from top of Chr4 lin 1576 

4 1 24.40 cR from top of Chr4 lin 1 354 

15 4 1 25.80 cR from top of Chr4 lin 1 879 

4 1 25.80 cR from top of Chr4 lin 1 882 

4 126.10 cR from top of Chr4 lin 1953 

4 1 28.3 cR from top of Chr4 link 2467 

4 139.80 cR from top of Chr4 lin 1906 

20 4 140.70 cR from top of Chr4 lin 1966 

4 1 41 .70 cR from top of Chr4 lin 1 466 

4 144.1 cR from top of Chr4 (ink 3129 

4 145.50 cR from top of Chr4 )in 1093 

4 1 46.00 cR from top of Chr4 lin 2902 

25 4 148.40 cR from top of Chr4 lin 1135 

4 1 48.60 cR from top of Chr4 lin 2464 

4 1 49.40 cR from top of Chr4 lin 1 362 

4 149.40 cR from top of Chr4 lin 1364 

4 153 cM 4306 

30 4 1 53.60 cR from top of Chr4 lin 3535 

4 174.10 cR from top of Chr4 lin 1479 

4 182.20 cR from top of Chr4 lin 1821 

4 1 93.30 cR from top of Chr4 lin 1 353 

4 1 93.8 cR from top of Chr4 link 1 984 

35 4 1 97.2 cR from top of Chr4 link 291 3 

4 199.00 cR from top of Chr4 lin 3219 

4 199.10 cR from top of Chr4 lin 1618 

4 199.70 cR from top of Chr4 lin 1865 

4 243.0 cR from top of Chr4 link 2739 

40 4 401.1 cR from top of Chr4 link 2152 

4 401.1 cR from top of Chr4 link 2153 

4 41 2.2 cR from top of Chr4 link 3031 

4 41 5.9 cR from top of Chr4 link 4564 

4 41 5.9 cR from top of Chr4 link 2568 

45 4 41 9.4 cR from top of Chr4 link 880 

4 426.7 cR from top of Chr4 link 982 

4 474.6 cR from top of Chr4 link 3019 

4 474.6 cR from top of Chr4 link 3020 

4 483.0 cR from top of Chr4 link 271 2 

50 4 497.5 cR from top of Chr4 link 3981 



WIAF 

WIAF 

WiAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

UWGC 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-2113 

WIAF-2114 

WIAF-3765 

WIAF-1333 

WIAF-2139 

WIAF-2193 

WIAF-1524 

WIAF-1469 

WIAF-3251 

WIAF-3810 

WIAF-3811 

WIAF-3514 

WIAF-3515 

WIAF-3257 

WIAF-3818 

WIAF-3821 

WIAF-3892 

WIAF-60 

WIAF-3845 

WIAF-3905 

WIAF-3401 

WIAF-1030 

WIAF- 1475 

WIAF-800 

WIAF-1589 

WIAF.57 

WIAF-3265 

WIAF-3267 

I 125 

WIAF~1990 

WIAF-3418 

WIAF-3760 

WIAF-3256 

WIAF-3923 

WIAF-812 

WIAF-1597 

WIAF-3557 

WIAF-3804 

WIAF-452 

WIAF-482 

WIAF-483 

WIAF-931 



HU-CHINA 1 4-197 



WIAF-197 

WIAF-1116 

WIAF-1408 

WIAF-919 

WIAF-920 

WIAF-407 



SHGC/AFFYMETRIX | SNP-SHGC-51 763 
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4.99 2 cR from too 
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nk 
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4 


BOS 3 cR from too 


of Chr4 1 


nk 


2815 




4 


508 3 cR from too 


of Chr4 I 

^/ 1 II r 1 


nk 


2816 
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R22 1 rR from too 


of Chr4 1 


nk 


2756 


5 


4 


R?'^ Q rR from too 


of Chr4 1 


nk 


2688 




4 


B26 7 rR from too 


of Chr4 1 

\J 1 WI II 1 


nk 


2914 




4. 


R'^'^ 1 rR from toti 


of Chr4 1 


nk 


4181 




A 
*r 


1 rR frnm ton 


of Chr4 1 


nk 


785 




*T 


R'^R 1 rR frnm ton 


of Chr4 1 


nk 


1312 


10 


A 
*+ 


RA'^ 1 rR frnm tnn 


of Chr4 1 


nk 


729 






Rfi'^ q rR frnm tnn 


of Chr4 ! 


nk 


2179 




A 


R72 A rR frnm tnn 


of Chr4 1 


nk 


3300 




A 

*T 


R72 Q rR frnm tnn 


of Chr4 1 


nk 


2344 






Rn2 7 rR frnm tnn 


of Chr4 1 


nk 


2995 


15 


4 


626 5 cR from too 


of Chr4 1 


nk 


2094 




4 


626 S fiR from too 


of Chr4 1 


nk 


2095 




4 


631 4 rR from too 


of Chr4 1 


nk 


2364 






6*^1 A rR frnm tnn 


of Chr4 1 


nk 


2365 




A 

*T 


fiA9 1 rR frnm tnn 


of Chr4 1 


nk 


2074 


20 


A 


6AA 6 rR frnm tnn 

\J*T*T»\J l*n II Villi LU|J 


of Chr4 1 


nk 


3823 




A 


C4 A A /«Q fmm tnn 


of Chr4 link 


3826 




A 


An 






3986 




A 


An 






3987 




A 


An 
^P 






3QQ1 


9R 


A 


An 






3Q99 




A 


An 






3QQd 




A 


An 
^P 










*f 


Art 

4p 










A 


An 








ow 


A 


An 
^P 










A 








A9R9 




A 








4.1 1 9 




A 








4.1 91 




A 








4.nRR 




A 








4nRfi 




A 








4.1 99 




A 








dnR7 












A1 9*5 




A 








A1 9A 


40 


4 








4125 




4 








4126 




4 








4129 




4 








4131 




4 








4061 


45 


4 








4062 




4 








4132 




4 








4063 




4 








4133 




4 








4134 


50 


4 








4135 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-524 
WIAF-610 
WIAF-611 
WIAF-484 
WIAF-373 
WIAF-813 



SHGC/AFFYMETRIX | SNP-SHGC-50672 
WIAF I WIAF-2048 
WIAF I WIAF-2201 
WIAF I WIAF-1078 
WIAF I WIAF-561 
WIAF I WIAF- 1753 
WIAF I WIAF-2575 
WIAF I WIAF-895 
WIAF I WIAF-121 
WIAF I WIAF-122 
WIAF I WIAF-2621 
WIAF I WIAF-2623 
WIAF I WIAF-8 
WIAF I WIAF-2587 
WIAF I WIAF-2591 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
MARSHFIELD | MID-7 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



SNPA-SHGC4-1659 
SNPA-SHGC-51324 
SNPB-SHGC4-1659 
SNPB-SHGC-51324 
SNPC-SHGC-51324 
SNP-SHGC4-1525 
SNP-SHGC-51310 
SNP-SHGC-51312 
SNP-SHGC-51346 

SNPA-SHQC-14934 
SNPA-SHGC-24080 
SNPA-SHGC-50187 
SNPA-SHGC-50252 
SNPA-SHGC-50922 
SNPA-SHGC-50928 
SNPA-SHGC-51072 
SNPA-SHGC-51160 
SNPA-SHGC-51438 
SNPA-SHGC-51690 
SNPB-SHGC-14934 
SNPB-SHGC-24080 
SNPB-SHGC-50187 
SNPB-SHGC-50252 
SNPB-SHGC-50922 
SNPB-SHGC-50928 
SNPB-SHGC-51072 
SNPB-SHGC-51160 
SNPB-SHGC-51438 
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4 4136 SHGC/AFFYMETRIX rSNPB-SHGC-51690 

4 3958 SHGC/AFFYMETRIX | SNP-SHGC-10699 

4 4005 SHGC/AFFYMETRIX | SNP-SHGC-1 3008 

4 4150 SHGC/AFFYMETRIX | SNP-SHGC-141 39 

5 4 4077 SHGC/AFFYMETRIX | SNP-SHGC-16028 

4 4091 SHGC/AFFYMETRIX ( SNP-SHGC-1 7200 

4 4167 SHGC/AFFYMETRIX | SNP-SHGC-23754 

4 '4169 SHGC/AFFYMETRIX | SNP-SHGC-24086 

4 4170 SHGC/AFFYMETRIX | SNP-SHGC-24090 

10 4 4171 SHGC/AFFYMETRIX I SNP-SHGC-25057 

4 4172 SHGC/AFFYMETRIX I SNP-SHGC-25080 

4 . 4173 SHGC/AFFYMETRIX | SNP-SHGC-25091 

4 4174 SHGC/AFFYMETRIX I SNP-SHGC-251 12 

4 4175 SHGC/AFFYMETRIX | SNP-SHGC-251 84 

15 4 4017 SHGC/AFFYMETRIX I SNP-SHGC4-1 137 

4 4018 SHGC/AFFYMETRIX j SNP-SHGC4-1 459 

4 4020 SHGC/AFFYMETRIX | SNP-SHGC4-1 597 

4 4021 SHGC/AFFYMETRIX | SNP-SHGC4-1678 

4 4023 SHGC/AFFYMETRIX | SNP-SHGC4-851 

20 4 3978 SHGC/AFFYMETRIX I SNP-SHGC-501 75 

4 3979 SHGC/AFFYMETRIX | SNP-SHGC-501 77 • 

4 4109 SHGC/AFFYMETRIX | SNP-SHGC-50262 

4 3980 SHGC/AFFYMETRIX | SNP-SHGC-50274 

4 4110 SHGC/AFFYMETRIX j SNP-SHGC-50293 

25 4 4176 SHGC/AFFYMETRIX I SNP-SHGC'50311 

4 4177 SHGC/AFFYMETRIX | SNP-SHGC-50320 

4 • 41 1 1 SHGC/AFFYMETRIX 1 SNP-SHGC-50369 

. 4 4024 SHGC/AFFYMETRIX | SNP-SHGC-50475 

4 4179 SHGC/AFFYMETRIX | SNP-SHGC-50477 

30 4 4180 SHGC/AFFYMETRIX | SNP-SHGC-50629 

4 4182 SHGC/AFFYMETRIX | SNP-SHGC-50730 

4 4113 SHGC/AFFYMETRIX 1 SNP-SHGC-50803 

4 4114 SHGC/AFFYMETRIX I SNP-SHGC-50804 

4 4115 SHGC/AFFYMETRIX 1 SNP-SHGC-50810 

35 '4 4183 SHGC/AFFYMETRIX | SNP-SHGC-50857 

4 4184 SHGC/AFFYMETRIX j SNP-SHGC-50859 

4 '4185 SHGC/AFFYMETRIX j SNP-SHGC-50880 

4 4186 SHGC/AFFYMETRIX | SNP-SHGC-50921 

4 4187 SHGC/AFFYMETRIX | SNP-SHGC-50993 

40 4 4025 SHGC/AFFYMETRIX I SNP-SHGC-5 1011 

4 -4188 SHGC/AFFYMETRIX | SNP-SHGC-51 034 

4 .4189 SHGC/AFFYMETRIX | SNP-SHGC-51 046 

4 4190 SHGC/AFFYMETRIX I SNP-SHGC-51 122 

4 4191 SHGC/AFFYMETRIX 1 SNP-SHGC-51 140 

45 4 4192 SHGC/AFFYMETRIX I SNP-SHGC-51 173 

4 4193 SHGC/AFFYMETRIX I SNP-SHGC-51 187 

4 4194 SHGC/AFFYMETRIX I SNP-SHGC-51 200 

: 4 4026 SHGC/AFFYMETRIX I SNP-SHGC-51 209 

4 . 4195 SHGC/AFFYMETRIX I SNP-SHGC-51 227 

50 4 4196 SHGC/AFFYMETRIX SNP-SHGC-51 237 
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4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 



FINE MAP 
LOCATION 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 



10 



15 



20 



25 



30 



35 



40 



45 



50 



4197 
4198 
4199 
4202 
4203 
4205 
4027 
4028 
4029 
4206 
4207 
4208 
4209 
4116 
4210 
3983 
2528 
3531 
3686 
3688 
2950 



5 0.00 cR from top of Chr5 linka 

5 5.2 cR from top of Chr5 linkag 

5 1 6.30 cR from top of Chr5 link 

5 1 6.30 cR from top of Chr5 link 

5 1 8.60 cR from top of Chr5 link 

5 1 9.50 cR from top of Chr5 link 

5 1 9.70 cR from top of Chr5 link 

5 36.8 cR from top of Chr5 linka 

5 39.10 cR from top of Chr5 link 

5 39.10 cR from top of Chr5 link 

5 44.5 cR from top of Chr5 linka 



5 


45.40 


cR 


from 


top 


of 


Chr5 


link 


5 


51.60 


cR 


from 


top 


of 


Chr5 


link 


5 


51.60 


cR 


from 


top 


of 


Chr5 


link 


5 


57.30 


cR 


from 


top 


of 


Chr5 


ink 


5 


62.80 


cR 


from 


top 


of 


Chr5 


ink 


5 


65.00 


cR 


from 


top 


of 


Chr5 


ink 


5 


69.40 


cR 


from 


top 


of 


Chr5 


link 


5 


69.40 


cR 


from 


top 


of 


Chr5 


link 


5 


79.40 


cR 


from 


top 


of 


Chr5 


ink 


5 


80.20 


cR 


from 


top 


of 


Chr5 


ink 


5 


80.30 


cR 


from 


top 


of 


Chr5 


link 


5 


82.30 


cR 


from 


top 


of 


Chr5 


link 


5 


82.80 


cR 


from 


top 


of 


Chr5 


link 


5 


82.80 


cR 


from 


top 


of 


Chr5 


ink 


5 


84.10 


cR 


from 


top 


of 


Chr5 


ink 


5 


84.10 


cR 


from 


top 


of 


Chr5 


ink 



3349 
2285 
3330 
3331 
1359 
1410 
2013 
2953 
1810 
1813 
3076 
1621 
1105 
1415 
1464 
1636 
3148 
1986 
1987 
3414 
1512 
1665 
3010 
3249 
1514 
1591 
1605 



SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



WIAF 
WIAF 
WIAF 
WiAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF- 138 
WIAF-1986 
WIAF-2414 
WIAF-241 6 
WlAF-849 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



1802 

1331 

1783 

•1784 

-3262 

3331 

1507 

•852 

3749 

3752 

977 

3560 

1532 

3342 

3399 

3575 

1049 

3925 

3926 

1869 

3451 

3604 

910 

1667 

3453 

3530 

3544 



SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP'SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC- 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 
SNP-SHGC 



51240 
51242 
51249 
51323 
51340 
51387 
51411 
51435 
51467 
51477 
51520 
51554 
51579 
51662 
51721 
9709 
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5 86.10 cRfromtopofChr5 link 3180 WIAF 

5 87.20 cRfromtopofChr5 link 1525 WIAF 

5 92.30 cR from top of Chr5 link 1608 WIAF 

5 93,80 cR from top of Chr5 link 1614 WIAF 

5 5 97.90 cR from top of Chr5 link 1515 WIAF 

5 97.90 cR from top of Chr5 link 1 51 7 WIAF 

5 103.0 cR from top of Chr5 link 21 13 WIAF 

5 104.50cRfromtopof ChrSlin 2033 WIAF 

5 104.50 cR from top of Chr5 lin 3367 WIAF 

10 5 104.50 cR from top of Chr5 lin 3368 WIAF 

5 104.50 cRfromtopofChr5 lin 3369 WIAF 

5 109.00 cR from top of Chr5 lin 3107 WIAF 

5 1 1 7.3 cR from top of Chr5 link 2229 WIAF 

5 1 17.3 cR from top of Chr5 link 2230 WIAF 

15 5 121.60 cR from top of Chr5 lin 997 WIAF 

5 1 22.30 cR from top of .Chr5 lin 1 221 WIAF 

5 1 22.30 cR from top of Chr5 lin 1 222 WIAF 

5 1 22,60 cR from top of Chr5 lin 1 939 WIAF 

5 124.3 cR from top of Chr5 link 3145 WIAF 

20 5 131 .7 cR from top of Chr5 link 3835 WIAF 

5 131.7 cR from top of Chr5 link 3836 WIAF 

5 132.90 cR from top of Chr5 lin 1832 WIAF 

5 140.90 cR from top of Chr5 lin 1215 WIAF 

5 141 .00 cR from top of Chr5 lin 2577 WIAF 

25 5 1 41 .40 cR from top of Chr5 lin 1 467 WIAF 

5 142.30 cR from top of Chr5 lin 1620 WIAF 

5 .144.10cRfromtopofChr5 lin 3118 WIAF 

5 153.50 cRfromtopofChr5 lin 3282 WIAF 

5 156.10 cR from top of Chr5 lin 1902 WIAF 

30 5 1 56.40 cR from top of Chr5 lin 1849 WIAF 

5 163.00 cRfromtopofChr5 lin 1272 WIAF 

5 163.00cRfromtopof ChrSltn 1273 WIAF 

5 163.00cRfromtopofChr5 lin 1274 WIAF 

5 169.80 cR from top of Chr5 lin 1788 WIAF 

35 5 169.80 cRfromtopofChr5 lin 1790 WIAF 

5 182.0cRfromtopofChr5 link 2478 WIAF 

5 1 86.40 cR from top of Chr5 lin 691 WIAF 

5 187.9cRfromtopofChr5 link 3343 WIAF 

5 194.00 cRfromtopofChr5 lin 1918 WIAF 

40 5 195.80 cR from top of Chr5 lin 1323 WIAF 

5 261 .7 cR from top of Chr5 link 3269 WIAF 

5 266.6 cR from top of Chr5 link 2575 WIAF 

5 282.2 cR from top of Chr5 link 2805 WIAF 

5 310.9 cR from top of Chr5 link 2860 WIAF 

45 5 334.8 cR from top. of Chr5 link 2666 WIAF 

5 334.8 cR from top of Chr5 link 2667 WIAF 

5 334.8 cR from top of Chr5 link 2668 WIAF 

5 351 .7 cR from top of Chr5 link 2792 WIAF 

5 357.7 cR from top of Chr5 link 2890 WIAF 

50 5 368.6 cR from top of Chr5 link 3598 WIAF 



WIAF- 
WIAF- 
WIAF- 
WIAF' 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



1496 

3464 

3547 

3553 

3454 

3456 

235 

1594 

1820 

1821 

1822 

1008 

662 

663 

2053 

2108 

2109 

3878 

1046 

2608 

2609 

3771 

2102 

209 

3402 

3559 

1019 

1735 

3841 

3788 

2159 

2160 

2161 

3727 

3729 

74 

1220 

1796 

3857 

2213 

1721 

205 

575 

755 

346 

347 

348 

548 

788 

2267 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 



6 
6 
6 



6 
6 
6 
6 
6 
6 



368.6 
372.6 
378.7 
401.5 
406.7 
425.1 
425.1 
431.5 
431.5 
437.0 
441.2 
500.0 
510.2 
532.5 
532.5 
532.5 
532.7 
534.1 
537.3 
537.4 
569.8 



cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 



top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 
top of 



Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Clir5 link 
Chr5 link 
Chr5 link 
Chr5 link 
Chr5 link 



0.0 cR from top of Chr6 linkag 
1 .40 cR from top of Chr6 linka 
1 .40 cR from top of Chr6 linka 
1 .40 cR from top of ChrB linka 
1 .40 cR from top of Chr6 linka 
1 .40 cR from top of Chr6 linka 
1 .6 cR from top of Chr6 linkag 
6.40 cR from top of Chr6 linka 
9.80 cR from top of Chr6 linka 
9.80 cR from top of ChrB, linka 
9.80 cR from top of Chr6 linka 
1 7.70 cR from top of Chr6 link 
17.80 cR from top of Chr6 link 
17,80 cR from top of Chr6 link 
17.80 cR from top of ChrB link 
20.50 cR from top of ChrB link 



2701 
3351 
2627 
3523 
3568 
2106 
2107 
2894 
2895 
2118 
3492 
3374 
1200 
4578 
4550 
2889 
1121 
2442 
3455 
2677 
2211 
4253 
4052 
4058 
3961 
4142 
4160 
4080 
4050 
1101 
1492 
3881 



3133 
2028 
1497 
1674 
1782 
1827 
2958 
1209 
1657 
1658 
1659 
3119 
1373 
1933 
1936 
1124 



WIAF I WlAF-389 
WIAF I WlAF-1804 
WIAF I WIAF-287 
WIAF I WIAF-1978 
WIAF I WIAF-2023 
WIAF I WIAF-1 86 
WIAF I WIAF-187 
WIAF I WlAF-792 
WIAF I WIAF-793 
WIAF 1 WIAF-276 
WIAF I WIAF-1947 
WIAF I WIAF-1827 
WIAF I WIAF-2087 
HU-CHINA I 5-787 
HU-CHINA I 5-787-2 
WIAF I WIAF-787 
WIAF I WIAF-1561 
WIAF t WIAF-27 
WIAF I WIAF-1910 
WIAF ] WIAF-359 
WIAF I WIAF-631 
MARSHFIELD | MID-8 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



SNPA-SHGC-16519 

SNPB-SHGC-16519 

SNP-SHGC-10972 

SNP-SHGC-13353 

SNP-SHGC-14742 

SNP-SHGC-16780 

SNP-SHGC-9420 



WIAF 
WIAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WlAF-1 520 
WIAF-3431 
WIAF-3942 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



1034 

1583 

3436 

3613 

3721 

3766 

857 

2096 

3596 

3597 

3598 

1020 

3277 

3872 

3875 

15B7 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



6 41 

6 41 

6 43 

6 44 



6 
6 
6 
6 
6 
6 



6 21 .30 cR from top of Chr6 link 1 551 WIAF 

6 24.50 cR from top of Chr6 link 3066 WIAF 

6 31 .7 cR from top of Chr6 linka 2366 WIAF 

6 34.20 cR from top of Chr6 link 1 109 WIAF 

6 34.90 cR from top of Chr6 link 3511 WIAF 

6 38.00 cR from top of ChrS link 1 733 WIAF 

5 cR from top of Chr6 linka 2522 WIAF 

5 cR from top of Chr6 linka 2523 WIAF 

50 cR from top of Chr6 link 3504 WIAF 

20 cR from top of Chr6 link 321 1 WIAF 

6 46.00 cR from top of Chr6 link 2003 WIAF 

6 46.00 cR from top of Chr6 link 2004 WIAF 

6 46.00 cR from top of Chrd link 2005 WIAF 

6 46.60 cR from top of Chr6 link 1116 WIAF 

60 cR from top of Chr6 link 1117 WIAF 

60 cR from top of Chr6 link 1118 WIAF 

60 cR from top of ChrS link 1119 WIAF 

60 cR from top of Chr6 link 1546 WIAF 

60 cR from top of Chr6 link 1 548 WIAF 

60 cR from top of Chr6 link 1866 WIAF 

60 cR from top of ChrS link 1992 WIAF 

70 cR from top of Chr6 link 3270 WIAF 

80 cR from top of Chr6 link 1729 WIAF 

80 cR from top of Chr6 link 1732 WIAF 

80 cR from top of Chr6 link 1735 WIAF 

,90 cR from top of ChrS link 678 WIAF 

6 47.00 cR from top of Chr6 link 3260 WIAF 

6 : 47.00 cR from top of Chr6 link 3480 WIAF 

6 47.00 cR from top of Chr6 link 1385 WIAF 

6 47.00 cR from top of Chr6 link 1601 WIAF 

6 47.00 cR from top of Chr6 link 1905 WIAF 

6 47.00 cR from top of Chr6 link 1907 WIAF 

6 47.00 cR from top of Chr6 link 2997 WIAF 

6 47.00 cR from top of Chr6 link 2998 WIAF 

6 47.10 cR from top of Chr6 link 1769 WIAF 

20 cR from top of Chr6 link 3108 WIAF 

30 cR from top of Chr6 link 1754 WIAF 

30 cR f rom top of Chr6 link 1755 WIAF 

30 cR from top of Chr6 link 1757 WIAF 

70 cR from top of Chr6 link 1219 WIAF 

80 cR from top of Chr6 link 1216 WIAF 

90 cR from top of Chr6 link 1472 WIAF 

90 cR from top of Chr6 link 1474 WIAF 

50 cM 4317 UWGC 

51 .1 cR from top of Chr6 linka 3049 WIAF 

52.80 cR from top of Chr6 link 2030 WIAF 

52.80 cR from top of ChrS link 2031 WIAF 

54.6 cR from top of 'Chr6 linka 3484 WIAF 

59.5 cR from top of Chr6 linka 2990 WIAF 

65,50 cR from top of Chr6 link 1188 WIAF 



46 
46 
46 
46 
46 
46 
6 46 
6 46 
6 46 
6 46 
6 46 
6 46 



6 47 
6 47 
6 47 
6 47 
6 47 
6 47 
6 47 
6 47 
6 
6 
6 
6 
6 
6 
6 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
I 136 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



3490 

967 

•2627 

1541 

1966 

3672 

131 

132 

1959 

1574 

1460 

1461 

1462 

1551 

1552 

1553 

1554 

3485 

3487 

3805 

3931 

1722 

3668 

3671 

3674 

1453 

1696 

1935 

3290 

3540 

3844 

3846 

897 

898 

3708 

1009 

3693 

3694 

3696 

2106 

2103 

3409 

3411 

950 

1586 

1587 

1939 

890 

2075 



-124- 



wo 01/62968 



PCT/USOl/00452 



CHROMOSOME 



FINE MAP 
LOCATION 



dbSNP HANDLE | LOCAL 
ASSAY ID SNP ID 



6 66.60 cR from top of ChrB link 3220 WIAF 

6 66.60 cR from top of Chr6 link 3221 WIAF 

6 69.90 cR from top of ChrB link 3096 WIAF 

6 71 .3 cR from top of Chr6 linka 2981 WIAF 

5 6 79.90 cR from top of Chr6 link 1322 WIAF 

6 85.50 cR from top of Chr6 link 1850 WIAF 

6 86.40 cR from top of Chr6 link 1761 WIAF 

6 89.5 cR from top of Chr6 linka 2510 WIAF 

6 89.60 oR from top of Chr6 link 3022 WIAF 

10 6 90.50 cR from top of Chr6 link 1528 WIAF 

6 90.50 cR from top of Chr6 link 1532 WIAF 

6 93.60 cRfromtopofChr6 link 3172 WIAF 

6 95.50 cR from top of Chr6 link 3251 WIAF 

6 95.50 cR from top of Chr6 link 3252 WIAF 

15 6 95.50 cR from top of Chr6 link 1818 WIAF 

6 96.00 cRfromtopofChr6 link 1807 WIAF 

6 99.8 cR from top of Chr6 linka 2090 WIAF 

6 99.8 cR from top of Chr6 linka 209.1 WIAF 

6 102.40 cR from top of Chr6 lin 3244 WIAF 

20 6 105.40 cR from top of Chr6 lin 1783 WIAF 

6 106.40 cR from top of Chr6 lin 3167 WIAF 

6 106.90 cR from top of Chr6 lin 1417 WIAF 

6 116.20 cR from top of Chr6 lin 1809 WIAF 

6 1 1 6.60 cR from top of Chr6 (in 3420 WIAF 

25 6 11 6.60 cR from top of Chr6 lin 3421 WIAF 

6 1 1 6.60 cR from top of Chr6 lin 3422 WIAF 

6 1 1 6.60 cR from top of .Chr6 lin 3423 WIAF 

6 1 1 6.60 cR from top of Chr6 lin 3424 WIAF 

6 1 1 6.60 cR from top of Chr6 lin 3425 WIAF 

30 . 6 1 16.60 cR from top of Chr6 lin 3426 WIAF 

6 1 1 6.60 cR from top of Chr6 lin 1 329 WIAF 

6 1 1 6.90 cR from top of Chr6 lin 1 239 WIAF 

6 1 1 7.00 cR from top of Chr6 lin 1582 WIAF 

6 1 24.20 cR from top of Chr6 lin 1284 WIAF 

35 6 125.4 cR from top of Chr6 link 3441 WIAF 

6 1 25,4 cR from top of Chr6 link 3i328 WIAF 

6 125.4 cR from top of Chr6 link 2716 WIAF 

6 1 25,40 cR from top of Chr6 lin 1 592 WIAF 

6 1 25,80 cR from top of Chr6 lin 3254 WIAF 

40 6 131.20cRfromtopofChr6 lin 1535 WIAF 

6 1 37.90 cR from top of Chr6 lin 1706 WIAF 

6 144.50 cR from top of Chr6 lin 1 141 WIAF 

6 144.50cRfromtopofChr6 lin 1860 WIAF 

6 144.90cRfromtopofChr6 lin 3418 WIAF 

45 6 145.60 cR from top of Chr6 lin 3579 WIAF 

6 147.30 cR from top of Chr6 lin 1771 WIAF 

6 1 50.40 cR from top of Chr6 lin 1 1 50 WIAF 

6 1 50.40 cR from top of Chr6 lin 1 1 76 WIAF 

6 1 54.90 cR from top of Chr6 lin 1 563 WIAF 

50 6 1 55.70 cR from top of Chr6 lin 1 737 WIAF 



WIAF- 
WIAF- 
WIAF. 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 

wiaf: 

WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



1608 

1609 

997 

881 

2212 

3789 

3700 

116 

922 

3467 

3471 

1486 

1669 

1670 

3757 

3746 

105 

106 

1654 

3722 

1476 

3344 

3748 

1875 

1876 

1877 

1878 

1879 

1880 

1881 

2219 

2126 

3521 

2171 

1896 

2597 

415 

3531 

1674 

3474 

3645 

1604 

3799 

1873 

2034 

3710 

1623 

1701 

3502 

3676 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 



6 
6 
6 
6 
6 
6 
6 
6 
6 
6 



155, 
159, 
165. 
165, 
166, 
166, 
175, 
177, 
177, 
177, 
178, 
180, 
180, 
180. 
180, 
180, 
180, 
180. 
180, 
180 
180, 
180. 
184 
187 
187 
187, 
187 
187 
188, 
188, 
189, 
190 
190, 
190, 
201 
201 
201 
203 
212 
212 
217 
218 
249 
249 
256 
275 
276 
331 
6 487 
6 576 



70 cR from top of Chr6 lin 1738 WIAF 

00 cR from top of Chr6 lin 1867 WIAF 

70 cR from top of Chr6 lin 3499 WIAF 

70 cR from top of Chr6 lin 3500 WIAF 

40 cR from top of Chr6 lin 1465 WIAF 

80 cR from top of Chr6 lin 1513 WIAF 

90 cR from top of Chre lin 301 5 WIAF 

0 cR from top of Chr6 link 2798 WIAF 

0 cR from top of Chr6 link 2799 WIAF 

0 cR from top of Chr6 link 2800 WIAF 
40 cR from top of Chr6 lin 1896 WIAF 
3 cR from top of Chr6 link 261 1 WIAF 

3 cR from top of Chr6 link 2612 WIAF 

4 cR from top of Chr6 link 951 WIAF 
4 cR from top of Chr6 link 952 WIAF 
4 cR from top of Chr6 link 953 WIAF- 
4 cR from top of Chr6 link 954 WIAF 
4 cR from top of Chr6 link 955 WIAF 
4 cR from top of Chr6 link 956 WIAF 
4 cR from top of Chr6 link 957 WIAF 
4 cR from top of Chr6 link 958 WIAF 
4 cR from top of Chr6 link 1038 WIAF 
80 cR from top of Chr6 lin 1426 WIAF 

1 cR from top of Chr6 link 3565 WIAF 

1 cR from top of Chr6 link 3566 WIAF 
7 cR from top of Chr6 link 1993 WIAF 

7 cR from top of Chr6 link -1 996 WIAF 

8 cR from top of Chr6 link 2529 WIAF 

2 cR from top of Chr6 link 1660 WIAF 
40 cR from top of Chr6 lin 1427 WIAF 

00 cR from top of Chr6 lin 1 299 WIAF 
30 cR from top of Chr6 lin 1440 WIAF 
30cRfromtopofChr6 lin 1442 WIAF 
30 cR from top of Chr6 lin 1443 WIAF 

1 cR from top of Chr6 link 1290 WIAF 
lOcRfromtopof Chreiin 1763 WIAF 
10 cR from top of Chr6 lin 1765 WIAF 

nk 2505 WIAF 

nk 3105 WIAF 

nk 3837 WIAF 

nk 3016 WIAF 

nk 3293 WIAF 

nk 857 WIAF 

nk 1063 WIAF 

nk 1845 WIAF 

nk 2499 WIAF 

nk 3327 WIAF 

nk 2907 WIAF 

nk 2809 WIAF 

nk 2296 WIAF 



,9 cR from top of Chr6 I 
.4 cR from top of Chr6 li 
.6 cR from top of Chr6 I 
.6 cR from top of Chr6 I 
.7 cR from top of Chr6 I 
.6 cR from top of Chr6 I 
.6 cR from top of Chr6 I 
2 cR from top of Chr6 li 
6 cR from top of Chr6 li 
6 cR from top of Chr6 li 
1 cR from top of Chr6 li 
0 cR from top of Chr6 11 
9 cR from top of Chr6 li 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF' 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF" 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



3677 

3806 

1954 

1955 

3400 

3452 

915 

556 

557 

558 

3835 

257 

258 

1335 

1336 

1337 

1338 

1339 

1340 

1341 

1342 

4120 

3354 

2020 

2021 

3932 

3935 

139 

3599 

3355 

2186 

3369 

3371 

3373 

2177 

3702 

3704 

110 

1006 

2610 

916 

1746 

1060 

4188 

3784 

98 

1780 

805 

689 

2281 
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6 625.0 cR from top of Chr6 link 2084 WIAF | WIAF-78 

6 706.1 cR from top of Chr6 link 2945 WIAF | WIAF-844 

6 706.1 cR from top of Chr6 link 2946 WIAF | WIAF-845 

6 71 1 .4 cR from top of Chr6 link 2808 WIAF | WIAF-584 

5 6 729.2 cR from top of Chr6 link 4588 HU-CHINA | 6-985 

6 729.2 cR from top of Chr6 link 3084 WIAF | WIAF-985 

6 734;7 cR from top of Chr6 link 2274 WIAF | WIAF-736 

6 734.7 cR from top of Chr6 link 2275 WIAF | WIAF-737 

6 736.4 cR from top of Chr6 link 1784 WIAF | WIAF-3723 

10 6 739.6 cR from top of Chr6 link 2.158 WIAF | WIAF-492 

6 799.2 cR from top of Chr6 link 2073 WIAF | WIAF-3 

6 81 2.0 cR from top of Chr6 link 2747 WIAF | WIAF-466 

6 822.5 cR from top of Chr6 link 2193 WIAF | WIAF-592 

6 837.8 cR from top of Chr6 link 3081 WIAF | WIAF-982 

1 5 6 846.2 cR from top of Chr6 link 2386 WIAF | WIAF-2680 

6 856.6 cR from top of Chr6 link 2498 WIAF \ WIAF-97 ' 

6 858.4 cR from top of Chr6 link 2080 WIAF | WIAF-30 

6 860.5 cR from top of Chr6 link 2865 WIAF | WIAF-762 

6 4218 MARSHFIELD | MID-10 

20 6 . 4219 MARSHFIELD I MID-11 

6 4254 MARSHFIELD | MID-9 

6 4117 SHGG/AFFYMETRIX | SNPA-SHGC-1 3699 

6 3988 SHGC/AFFYMETRIX | SNPA-SHGC-6809 

6 4127 SHGC/AFFYMETRIX j SNPB-SHGC-1 3699 

25 6 3993 SHGC/AFFYMETRIX | SNPB-SHGC-6809 

6 3960 SHGC/AFFYMETRIX | SNP-SHGC-10969 

6 4002 SHGC/AFFYMETRIX | SNP-SHGC-1 2214 

6 '4149 SHGC/AFFYMETRIX | SNP-SHGC-1 41 1 1 

6 4152 SHGC/AFFYMETRIX j SNP-SHGC-1 4233 

30 6 4158 SHGC/AFFYMETRIX j SNP-SHGC-1 471 9 

6 3975 SHGC/AFFYMETRIX j SNP-SHGC-34704 

6 3977 SHGC/AFFYMETRIX | SNP-SHGC-44682 

6 4042 SHGC/AFFYMETRIX j SNP-SHGC-8858 

6 3149 WIAF 1 WIAF-1050 

35 6 1107 WIAF I WIAF-1539 

6 3257 WIAF j WIAF-1685 

6 3877 WIAF j WIAF-2678 

6 1 505 WIAF I WIAF-3444 

6 1545 WIAF 1 WIAF-3484 

40 6 1616 WIAF I WIAF-3555 

6 1781 WIAF I WIAF-3720 

6 1787 WIAF I WIAF-3726 

6 . 1789 WIAF I WIAF-3728 

6 1791 WIAF 1 WIAF-3730 

45 6 1793 WIAF I WIAF-3732 

6 1795 WIAF I WIAF-3734 

6 -1801 WIAF I WIAF-3740 

6 . 1851 WIAF I WIAF-3790 

6 * 1904 WIAF I WIAF-3843 

50 6 1932 WIAF 1 WIAF-3871 
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6 
6 
6 



1935 WIAF 

1938 WIAF 

2972 WIAF 

2973 WIAF 
3063 WIAF 
3086 WIAF 



I WIAF-3874 
I WIAF-3877 
I WIAF-872 
I WIAF-873 
I WIAF-964 
I WIAF-987 



7 2.20 cR from top of Chr7 linka 1804 

10 7 5.20 cR from top of Chr7 llnka 1 300 

7 18.10 cR from top of Chr7 link 1759 

7 19.00 cR from top of Chr7 link 1457 

7 22.00 cR from top of Chr7 link 1913 

7 26.4 cR from top of Chr7 linka 2926 

15 7 29.10 cR from top of Chr7 link 1717 

7 29.10 cR from top of Chr7 link 1796 

7 29.10 cR from top of Chr7 link 1808 

7 34.8 cR from top of Chr7 linka 3832 

7 37.3 cR from top of Chr7 linka 4579 

20 7 37.3 cR from top of Chr7 linka 4551 

7 37.3 cR from top of Chr7 linka 2669 

7 37.3 cR from top of Chr7 linka 2670 

7 39.90 cR from top of Chr7 link 1493 

7 50.00 cR from top of Chr7 link 1 526 

25 7 58.9 cR from top of Chr7 linka 774 

7 64.6 cR from top of Chr7 linka 3807 

7 70.70 cR from top of Chr7 link 2928 

7 70.70 cR from top of Chr7 link 2929 

7 71.50 cR from top of Chr7 link 1220 

30 7 77.10 cR from top of Chr7 link 1291 

7 77. 1 0 cR from top of Chr7 link 1 292 

7 83.60 cR from top of Chr7 link 1401 

7 89.0 cR from top of Chr7 linka 2616 

7 90.20 cR from top of Chr7 link 2042 

35 7 93.2 cR from top of Chr7 linka 2771 

7 93.90 cR from top of Chr? link 1949 

7 98.00 cR from top of Chr7 link 1768 

7 102.30 cR from top of Clir7 lin 1664 

7 105.20 cR from top of Chr7 lin 668 

40 7 105.20 cR from top of Chr7 lin 669 

7 106 cM 4324 

7 109.50 cR from top of Chr7 lin 4573 

7 109.50 cR from top of Chr7 lin 667 

7 109,90 cR from top of Chr7 lin 4572 

45 7 109.90 cR from top of Chr7 lin 3179 

7 1 10.9 cR from top of Chr7 link 3375 

7 110.9 cR from top of Chr7 link 3376 

7 111.60 cR from top of Chr7 lin 3168 

1 111 .60 cR from top of Chr? lin 31 69 

50 7 11 2.00 cR from top of Chr7 lin 3563 



WIAF I WIAF-3743 
WIAF I WIAF-2189 
WIAF I WIAF-3698 
WIAF 1 WlAF-3390 
WIAF I WIAF-3852 
WIAF I WIAF-825 
WIAF I WIAF-3656 
WIAF I WIAF-3735 
WIAF I WIAF-3747 
WIAF I WIAF-2601 
HU-CHINA I 7-349 
HU-CHINA 1 7-349-2 
WIAF J WIAF-349 
WIAF I WIAF-350 
WIAF I WIAF-3432 
WIAF I WIAF-3465 
WIAF I WIAF- 1405 
WIAF I WIAF-2559 
WIAF 1 WIAF-827 
WIAF I WIAF-828 
WIAF I WIAF-2107 
WIAF I WIAF-2178 
WIAF I WIAF-2179 
WIAF I WIAF-3321 
WIAF I WIAF-263 
WIAF I WIAF-1631 
WIAF I WIAF-514 
WIAF 1 WIAF-3888 
WIAF I WIAF-3707 
WIAF 1 WIAF-3603 
WIAF I WIAF-1240 
WIAF 1 WIAF-1332 
UWGC I 143 
HU-CHINA I 7-1100 
WIAF I WIAF-1100 
HU-CHINA 1 7-1495 
WIAF I WIAF-1495 
WIAF I WIAF-1828 
WIAF I WIAF-1829 
WIAF I WIAF-1477 
WIAF I WIAF-1478 
WIAF I WIAF-2018 
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7 1 1 2.00 cR from top of Chr7 lin 1 81 5 

7 1 1 2.00 cR from top of Chr7 lin 1816 

7 11 2.30 cR from top of Chr7 lin 4574 

7 1 1 2.30 cR from top of Chr7 lin 1 098 

5 7 11 2.90 cR from top of Chr7 lin 1 1 99 

7 1 1 3.40 cR from top of Chr7 lin 1 927 

7 1 1 7.20 cR from top of Chr7 lin 4593 

7 1 1 7.20 cR from top of Chr7 lin 4594 

7 1 1 7.20 cR from top of Chr7 lin 4595 

10 7 1 1 7.20 cR from top of Chr7 lin 1167 

7 1 1 7,20 cR from top of Chr7 lin 1 1 68 

7 1 1 7.20 cR from top of Chr7 lin 1 1 69 

7 1 1 9.80 cR from top of Chr7 lin 1 1 80 

7 1 22.6 cR from top of Chr7 link 2546 

15 7 125.50 cR from top of Chr7 lin 1381 

7 126.60 cR from top of Chr7 lin 1980 

7 1 26.60 cR from top of Chr7 lin 1 981 

7 129.10 cR from top of Chr7 lin 1387 

7 129.90 cR from top of Chr7 lin 1711 

20 7 135.3 cR from top of Chr7 link 3002 

7 1 36.50 cR from top of Chr7 lin 3861 

7 139.70 cR from top of Chr7 lin 1839 

7 147.7 cR from top of Chr7 link 3357 

7 150.1 cR from top of Chr7 link 3571 

25 7 1 50.1 cR from top of Chr7 link 3572 

7 1 55.00 cR from top of Chr7 lin 71 1 

7 1 55.00 cR from top of Chr7 lin 712 

7 165.60 cR from top of Chr7 lin 1777 

7 165.60 cR from top of Chr7 lin 1778 

30 7 1 69.00 cR from top of Chr7 lin 1 1 36 

7 172.90 cR from top of Chr7 lin 3227 

7 176.60 cR from top of Chr7 lin 1743 

7 182.40 cR from top of Chr7 lin 2039 

7 182.40 cR from top of Chr7 lin 1298 

35 7 183.2 cR from top of Chr7 link 3505 

7 184.00 cR from top of Chr7 lin 1727 

7 184.00 cR from top of Chr7 lin 2901 

7 187.2 cR from top of Chr7 link 2214 

7 390.2 cR from top of Chr7 link 2603 

40 7 399.5 cR from top of Chr7 link .221 5 

7 446.9 cR from top of Chr7 link 3370 

7 453.2 cR from top of Chr7 link 3060 

7 455.7 cR from top of Chr7 link 2908 

7 467.5 cR from top of Chr7 link 2769 

45 7 467.6 cR from top of Chr7 link 2828 

7 476.3 cR from top of Chr7 link 4580 

. 7 476.3 cR from top of Chr7 link 4552 

7 476.3 cR from top of Chr7 link 2502 

7 476.3 cR from top of Chr7 link 3319 

50 7 476.3 cR from top of Chr7 link 3320 



WIAF 
WIAF 
WIAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF 



WIAF I WlAF-3754 
WIAF I WIAF-3755 
HU-CHINA I 7-1510 
WIAF I WIAF-1510 
WIAF I WIAF-2086 
WIAF I WIAF-3866 
HU-CHINA I 7-1680 
HU-CHINA I 7-1680>2 
HU-CHINA I 7-1680-3 
WIAF I WIAF-1679 
WIAF I WIAF-1680 
WIAF I WIAF-1681 
WIAF I WIAF-1710 
WIAF I WIAF-167 
WIAF I WIAF-3285 
WIAF-3919 
WlAF-3920 
WIAF-3292 
WIAF-3650 
WIAF i WlAF-902 
WIAF I WIAF-2651 . 
WIAF-3778 
WIAF-1810. 
WIAF-2026 
WIAF-2027 
WIAF j WIAF-1447 
WIAF I WIAF-1448 
WIAF I WIAF-3716 
WIAF 1 WIAF-3717 
WIAF I WIAF-1599 
WIAF I WIAF-1627 
WIAF I WIAF-3682 
WIAF I WlAF-1620 
WIAF j WIAF-2185 
WIAF I WIAF-1960 
WIAF I WIAF-3666 
WIAF I WIAF-799 
WIAF I WIAF-636 . 
WIAF 1 WIAF-247 
WIAF 1 WIAF-637 
WIAF i WIAF-1823 
WIAF I WIAF-961 
WIAF I WIAF-806 
WIAF I WIAF-509 
WIAF I WIAF-644 
HU-CHINA I 7-1773 
HU-CHINA I 7-1773-2 
WIAF I WIAF-104 
WIAF I WIAF-1 772 
WIAF I WIAF-1 773 
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10 



15 



20 



25 



30 



35 



40 



45 



7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 



7 
7 



479.2 
491.0 
49T.0 
493.0 
493.0 
495.4 
495.4 
495.4 
497.2 
502.4 
514.6 
522.1 
530.0 
568.6 
568.6 
598.6 
602.0 
602.0 
603.1 
603.1 
621.5 
646.3 
663.1 
663.1 
668.3 
669.9 
670.2 
670.6 



cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 
cR from 



top o 
top o 
top o 
top o 
top o 
top a 
top 0 
top 0 
top o 
top o 
top o 
top o 
top o 
top 0 
top 0 
top o 
top o 
top o 
top o 
top o 
top o 
top 0 
top o 
top o 
top o 
top o 
top o 
top o 



Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 li 
Chr7 11 
Chr7 li 
Chr7 lii 
Chr7 link 
Chr7 link 
Chr7 link 
Chr7 link 
Chr7 link 
Chr7 link 



nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
Ink 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 



2187 
4565 
3328 
2354 
2355 

867 
1265 
1058 
2548 
2909 
2487 
3329 
2866 
2678 

853 
3338 
2534 
2535 
3284 
3285 

730 
3154 
2444 
2445 
3332 
3536 
2751 
3522 

333 
3954 
3955 
3956 
4247 
4250 
4251 
4144 
4084 
4090 
4100 
3973 
3195 
1132 
2559 
1264 
1688 
2425 
2840 
3082 



WIAF I WIAF-570 

HU-CHINA I 7-1781 

WIAF I WIAF-1781 

WIAF I WIAF-2594 

WIAF 1 WIAF-2595 

WIAF 1 WIAF-1080 

WIAF I WIAF-2152 

WIAF- 1 WIAF-4174 

WIAF I WIAF-169 

WIAF I WIAF-807 

WIAF I WIAF-85 

WIAF I WlAF-1782 

WIAF I WIAF-764 

WIAF I WIAF-360 

WIAF I WIAF-361 

WIAF j WlAF-1791 

WIAF I WIAF-147 

WIAF I WIAF-148 

WIAF I WIAF-1737 

WIAF I WIAF-1738 

WIAF I WlAF-1087 

WIAF I WIAF-1055 * 

WIAF I WIAF-32 

WIAF I WIAF-33 

WIAF I WIAF-1785 

WIAF I WIAF-1991 

WIAF I WIAF-473 

WIAF I WIAF-1977 

EXAMPLE I CTFR-tttdel 

MARSHFIELD | MlD-1 

MARSHFIELD | MID-2 

MARSHFIELD | MlD-3 

MARSHFIELD | MlD-4 

MARSHFIELD j MID-5 • 

MARSHFIELD | MID-6 

SHGC/AFFYMETRIX | SNP-SHGC-1 3664 

SHGC/AFFYMETRIX j SNP-SHGC-1 6934 

SHGC/AFFYMETRIX j SNP-SHGC-1 71 67 

SHGC/AFFYMETRIX j SNP-SHGC-1 9036 

SHGC/AFFYMETRIX | SNP-SHGC-32515 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-1530 

WlAF-1579 

WlAF-183 

WIAF-2151 

WIAF-3627 

WIAF-5 

WIAF-678 

WIAF-983 
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8 0.1 cR from top of Chr8 llnkag 2267 WIAF 

8 0.1 cR from top of Chr8 llnkag 2268 WIAF 

8 0.70 cR from top of Chr8 linka 1 785 WIAF 

8 6.50 cR from top of Chr8 llnka 3442 WIAF 

5 8 6.50 cR from top of Chr8 llnka 3443 WIAF 

8 8.20 cR from top of Chr8 linka 1895 WIAF 

8 1 1 .1 cR from top of Chr8 linka 3840 WIAF 

8 1 3.40 cR from top of ChrB link 1281 WIAF 

8 1 3.40 cR from top of Chr8 link 1 282 WIAF 

10 8 1 5.50 cR from top of Chr8 link 1418 WIAF 

8 15.50 cR from top of Chr8 link 1419 WIAF 

8 20.40 cR from top of Chr8 link 3537 WIAF 

8 20.40 cR from top of Chr8 link 3538 WIAF 

8 22.7 cR from top of Chr8 linka 2624 WIAF 

1 5 8 30.70 cR from top of Chr8 link 2053 WIAF 

8 31.9 cR from top of Chr8 linka 3476 WIAF 

8 33.2 cR from top of Chr8 linka 2613 WIAF 

8 33.2 cR from top of Chr8 linka 2614 WIAF 

8 37.0 cR from top of Chr8 linka 2476 WIAF 

20 8 39.90 cR from top of Chr8 link 1233 WIAF 

8 40.90 cR from top of ChrB link 3218 WIAF 

8 42.70 cR from top of Chr8 link 1 100 WIAF 

8 43.70 cR from top of Chr8 link 1 149 WIAF 

8 43.90 cR from top of Chr8 link 1894 WIAF 

25 8 44.40 cR from top of Chr8 link 1481 WIAF 

8 47.90 cR from top of ChrB link 1857 WIAF 

8 55.4 cR from top of Chr8 linka 2457 WIAF 

8 55.4 cR from top of ChrB linka 2458 WIAF 

8 60.1 cR from top of ChrB linka 1040 WIAF 

30 8 62.60 cR from top of ChrB link 1 174 WIAF 

8 62.70 cR from top of ChrB link 3275 WIAF 

8 62.80 cR from top of ChrB link 1870 WIAF 

. 8 63.30 cR from top of ChrB link 1682 WIAF 

8 68.70 cR from top of ChrB link 1843 WIAF 

35 8 80.90 cR from top of Chr8 link 2024 WIAF 

8 81 .50 cR from top of Chr8 link 1 1 33 WIAF 

8 81.50 cRfromtopof ChrB link 3302 WIAF 

8 88.30 cR from top of ChrB link 1 197 WIAF 

8 95.3 cR from top of ChrB linka 2674 WIAF 

40 8 95.3 cR from top of ChrB linka 2675 WIAF 

8 96.20 cR from top of ChrB link 1971 WIAF 

8 96.3 cR from top of ChrB linka 3092 WIAF 

8 98.0 cR from top of ChrB linka 2920 WIAF 

B 101 .00 cR from top of ChrB iin 708 WIAF 

45 8 101.3 cR from top of ChrB link 2967 WIAF 

8 103.7 cR from top of ChrB link 2933 WIAF 

8 105.80 cRfromtopof ChrB Iin 1205 WIAF 

B 1 08.90 cR from top of ChrB Iin 1739 WIAF 

B 1 09.00 cR from top of ChrB Iin 1 1 63 WIAF 

50 8 109.80 cR from top of ChrB Iin 1713 WIAF 



WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF. 
WIAF- 
WIAF' 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



•724 
-725 
•3724 
-1897 
•1898 
•3834 
2614 
•12168 
-2169 
-3345 
•3346 
■1992 
1993 
■283 
•1709 
1931 
■259 
-260 
72 

•2120 
•1596 
1517 
■1622 
3833 
•3420 
■3796 
48 
•49 
•4128 
-1693 
■1728 
3809 
■3621 
3782 
1565 
1580 
1755 
2084 
356 
357 
3910 
993 
819 
1406 
867 
832 
2092 
3678. 
1666 
3652 
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8 1 1 3.70 cR from top of Chr8 lin 2029 WIAF | WIAF-1 584- 

8 1 1 3.70 cR from top of Chr8 lin 1 336 WIAF | WIAF-2226 

8 115.8 cR from top of Chr8 link 2262 WIAF \ WIAF-716 

8 11 5,8 cR from top of Chr8 link 2263 WIAF j WIAF-71 7 

5 " 8 1 18.30 cR from top of ChrS lin 1350 WIAF | WIAF-3253 

8 11 8.30 cR from top of Chr8 lin 1 352 WIAF j WIAF-3255 

8 1 18.8 cR from top of Chr8 link 2722 WIAF i WIAF-426 

8 1 18.8 cR from top of ChrS link 2723 WIAF 1 WlAF-427 

8 1 19.00 cR from top of Chr8 lin 1447 WIAF j WIAF-3377 

1 0 8 1 19.00 cR from top of Chr8 lin 1448 WIAF j WIAF-3378 

8 1 21 .4 cR from top of Chr8 link 3629 WIAF | WIAF-2357 

8 124.1 cR from top of Chr8 link 2686 WIAF | WIAF-369 

8 124.1 cR from top of ChrS link 2687 WIAF 1 WIAF-370 

8 126.30 cR from top of ChrS lin 1863 WIAF j WIAF-3802 

15 8 1 26.30 cR from top of ChrS lin. 1 864 WIAF | WIAF-3803 

8 126.50 cR from top of ChrS lin 1213 WIAF | WIAF-2100 

8 1 26.50 cR from top of ChrS lin 1 234 WIAF | WIAF-21 21 

8 1 26.60 cR from top of ChrS lin 1326 WIAF | WIAF-2216 

8 1 32,00 cR from top of ChrS lin 1 672 WIAF j WIAF-361 1 

20 8 1 33.90 cR from top of ChrS lin 1742 WIAF j WIAF-3681 

8 147.10 cR from top of ChrS lin 3192 WIAF ] WIAF-1 516 

8 164.20 cR from top of ChrS lin 3128 WIAF j WlAF-1029 

8 164,70 cR from top of ChrS lin 3546 WIAF j WIAF-2001 

8 166.40 cR from top of ChrS lin 3240 WIAF j WIAF-1 650 

25 8 166.40 cR from top of ChrS lin 3241 WIAF | WIAF-1 651 

S 166.40 oR from top of ChrS lin 1847 WIAF j WIAF-37S6 

8 166.40 cR from top of ChrS lin 3009 WIAF j WIAF-909 

8 233.1 cR from top of ChrS link 3639 WIAF | WIAF-2367 

8 410.4 cR from top of ChrS link 3028 WIAF | WIAF-928 

30 8 416.4 cR from top of ChrS link 2474 WIAF | WIAF-69 

8 416.4 cR from top of ChrS link 2475 WIAF j WIAF-70 

8 435.6 cR from top of ChrS link 2133 WIAF j WIAF-354 

8 435.6 cR from top of ChrS link 2134 WIAF | WIAF-355 

8 441 .8 cR from top of ChrS link 3430 WIAF j WIAF-1 885 

35 8 466.7 cR from top of ChrS link 3391 WIAF | WIAF-1 846 

8 514.9 cR from top of ChrS link 2497 WIAF j WIAF-96 

8 541 .5 cR from top of ChrS link 3392 WIAF | WIAF-1 847 

8 579.6 cR from top of ChrS link 3873 WIAF | WlAF-2672 

S 5aS.3 cR from top of ChrS link 341 3 WIAF j WIAF-1 868 

40 8 591.7 cR from top of ChrS link 3526 WIAF j WIAF-1 981 

8 592.1 cR from top of ChrS link 2157 WIAF j WIAF-490 

8 592.1 cR from top of ChrS link 829 WIAF j WIAF-491 

8 592.4 cR from top of ChrS link 762 WIAF 1 WIAF-1 351 

8 625.1 cR from top of ChrS link 2459 WIAF | WIAF-50 

45 8 625.4 cR from top of ChrS link 2664 WIAF j WlAF-341 

8 628.6 cR from top of ChrS link 2202 WIAF j WIAF-61 5 

8 653.0 cR from top of ChrS link 2864 WIAF | WIAF-761 

8 656.3 cR from top of ChrS link 934 WIAF | WIAF-1 291 

8 659.2 cR from top of ChrS link 3088 WIAF j WIAF-9S9 

50 8 670.0 cR from top of ChrS link 2807 WIAF | WrAF-583 
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Chr8 link 
Chr8 link 
Chr8 link 
ChrS link 
ChrS link 
ChrS link 



10 



15 



20 



25 



30 



35 



40 



45 



50 



8 681.6 cR from top of 

8 689.6 cR from top of 

8 708.8 cR fronn top of 

8 717.8 cR from top of 

8 731.6 cR from top of 

8 791.3 cR from top of 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 

8 



9 0.00 cR from top of Chr9 linka 

9 0.00 cR from top of Chr9 linka 

9 7.40 cR from top of Chr9 linka 

9 12.10 cR from top of Chr9 link 

9 13.00 cR from top of Chr9 link 

9 13.00 cR from top of Chr9 link 

9 1 5.60 cR from top of Chr9 link 

9 15.60 cR from top of Chr9 link 

9 19.60 cR from top of Chr9 link 

9 25,00 cR from top of ehr9 link 

9 28.90 cR from top of Chr9 link 

9 29.5 cR from top of Chr9 linka 

9 30.0 cR from top of Chr9 linka 

9 30.0 cR from top of Chr9 linka 

9 44.00 cR from top of Chr9 link 

9 55.6 cR from top of Chr9. linka 

9 55.6 cR from top of Chr9 linka 

9 57.40 cR from top of Chr9 link 

9 57.50 cR from top of Chr9 link 

9 57.80 cR from top of Chr9 link 

9 62.0 cR from top of Chr9 linka 

9 62.0 cR from top of Chr9 linka 



2839 
775 
3126 
2504 
2239 
742 
4220 
4000 
4140 
4065 
3984 
3143 
3171 
3292 
3347 
3447 
3700 
3779 
1634 
3890 
3908 
2742 
2743 
2488 
2965 
3095 



1358 
1393 
1942 
1400 
1155 
1156 
2027 
1661 
1279 
688 
1749 
2105 
2777 
3056 
3464 
3298 
3299 
1283 
1403 
2556 
4575 
2970 



WIAF I WIAF-677 
WIAF I WIAF-1410 
WIAF I WIAF-1027 
•WIAF I WIAF-108 
WIAF I WIAF-676 
WIAF I WIAF-1186 
MARSHFIELD | MID-12 
SHGC/AFFYMETRIX | SNP-SHGC-1 2093 
SHGC/AFFYMETRIX | SNP-SHGC-1 31 26 
SHGC/AFFYMETRIX | SNP-SHGC-1 3448 
SHGC/AFFYMETRIX | SNP-SHGC-971 1 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF- 1044 

WIAF-1482 

WIAF-1745 

WIAF- 1800 

WIAF-1902 

WIAF-2428 

WIAF-2507 

WIAF-3573 

WIAF-3951 

WIAF-3985 

WIAF-457 

WIAF-458 

WIAF-86 

WIAF-865 

WIAF-996 



WIAF I WIAF-3261 
WIAF I WIAF.3298 
WIAF I WIAF-3881 
WIAF I WIAF-3320 
WIAF I WIAF-1640 
WIAF [ WIAF- 1641 
WIAF I WIAF-1582 
WIAF 1 WIAF-3600 
WIAF I WIAF-2166 
WIAF I WIAF-1194 
WIAF 1 WIAF-3688 
WIAF 1 WIAF-185 
WIAF I WlAF-521 
WIAF I WIAF-957 
WIAF I WIAF-1919 
WIAF I WIAF-1751 
WIAF 1 WIAF-1752 
WIAF I WIAF-2170 
WIAF I WIAF-3323 
WIAF I WIAF-180 
HU-CHINA I 9-870 
WIAF I WIAF-870 
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9 62.70 cR from top of Chr9 link 3246 WIAF 

9 62.70 cR from top of ChrS link 3247 WIAF 

9 64.0 cR from top of Chr9 linka 2272 WIAF 

9 64.10 cR from top of Chr9 link 2043 WIAF 

5 9 65.20 cR from top of ChrS link 1148 WIAF 

9 67.40 cR from top of Chr9 link 1318 WIAF 

9 67.40 cR from top of Chr9 link 1319 WIAF 

9 68.80 cR from top of ChrS link 1874 WIAF 

9 68.9 cR from top of ChrS linka 932 WIAF 

1 0 9 69.70 cR from top of Chr9 link 3433 WIAF 

9 74.9 cR from top of Chr9 linka 3444 WIAF 

9 74.9 cR from top of ChrS linka 3445 WIAF 

S 82.30 cR from top of ChrS link 1246 WIAF 

9 83.90 cR from top of Chr9 link 1 278 WIAF 

15 9 84.50 cR from top of Chr9 link 3482 WIAF 

S 84.60 cR from top of ChrS link 1 579 WIAF 

9 84.60 cR from top of ChrS link 1581 WIAF 

9 86.80 cR from top of ChrS link 1192 WIAF 

9 95.90 cR from top of Chr9 link 1892 WIAF 

20 9 100,60 cR from top of Chr9 lin 331 1 WIAF 

9 lOO.eOcRfromtopof ChrS lin 1405 WIAF 

9 105.20 cR from top of Chr9 lin 1615 WIAF 

9 105.80 cR from top of Chr9 lin 1237 WIAF 

9 105.80 cR from top of Chr9 lin 1238 WIAF 

25 9 106,20 cR from top of Chr9 lin 686 WIAF 

9 106.20 cR from top of ChrS lin 687 WIAF 

9 109.3 cR from top of Chr9 link 2237 WIAF 

9 109.3cRfromtopofChr9 link 2238 WIAF 

S 118.20 cR frdm top of ChrS lin 2881 WIAF 

30 S 122.20 cR from top of ChrS lin 1536 WIAF 

S 122.20cRfromtopof ChrS lin 1538 WIAF 

9 123.20 cR from top of Chr9 lin 1430 WIAF 

9 1 24.00 cR from top of Chr9 lin 1268 WIAF 

9 1 24.00 cR from top of ChrS lin 126S WIAF 

35 9 129.0cRfromtopofChr9 link 3830 WIAF 

9 132.60 cR from top of Chr9 lin 1999 WIAF 

9 136.30 cR from top of Chr9 lin 1154 WIAF 

9 1 37.00 cR from top of ChrS lin 1533 WIAF 

S 1 37.70 cR from top of ChrS lin 1301 WIAF 

40 S 137.70 cR from top of ChrS lin 1302 WIAF 

S 138.0 cR from top of ChrS link 2222 WIAF 

S 142.10 cR from top of ChrS lin 1640 WIAF 

S 142.70 cR from top of ChrS lin 1331 WIAF 

S 142.70 cR from top of ChrS lin 1S2S WIAF 

45 9 143.50cRfromtopofChr9 lin 1207 WIAF 

9 143.50 cR from top of Chr9 lin 1208 WIAF 

9 143.90 cR from top of ChrS lin 2558 WIAF 

9 144.60 cR from top of Chr9 lin 1797 WIAF 

9 148.7 cR from top of Chr9 link 891 WIAF 

50 S 1 48.7 cR from top of ChrS link 1042 WIAF 



WIAF-1657 

WIAF-1658 

WIAF-731 

WIAF-1634 

WIAF-161S 

WIAF-2208 

WIAF-220S 

WIAF-3813 

WlAF-1 280 

WlAF-1888 

WIAF-1899 

WIAF-1S00 

WIAF-2133 

WIAF-2165 

WIAF-1S37 

WIAF-3518 

WIAF-3520 

WIAF-207S 

WIAF-3831 

WIAF-1764 

WIAF-3325 

WIAF-3554 

WIAF-2124 

WlAF-2125 

WIAF-1177 

WIAF-1178 

WIAF-674 

WIAF-675 

WIAF-77S 

WIAF-3475 

WIAF-3477 

WIAF-3358 

WIAF-2155 

WIAF-2156 

WIAF-25SS 

WIAF-3938 

WIAF-1638 

WIAF-3472 

WIAF-21S0 

WIAF-21S1 

WIAF-64S 

WIAF-3579 

WIAF-2221 

WIAF-3868 

WIAF-2094 

WIAF-2095 

WIAF-182 

WIAF-3736 

WIAF-1136 

WIAF-4131 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



9 
9 
9 
9 
9 
9 



9 
9 
9 
9 
9 
9 
9 
9 
9 
9 
9 



9 
9 
9 
9 
9 
9 
9 
9 
9 
9 



164. 
166. 
210. 
264. 
293. 
326. 



9 328, 
9 328 



336 
342 
345 
367 
374 
389 
409 
437 
447 
449 
450 



9 480 
9 483 



493 
511 
512 
515 
516 
518 
523 
526 
526 
526 



9 526 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 

9 



70 cR from 
50 cR from 

3 cR from 

4 cR from 

7 cR from 
9 cR from 

5 cR from 

5 cR from 
4 cR from 

6 cR from 
4 cR from 

4 cR from 

1 cR from 

8 cR from 
6 cR from 

9 cR from 
0 cR from 
9 cR from 
0 cR from 

5 cR from 
4 cR from 

6 cR from 

3 cR from 

4 cR from 
6 cR from 
3 oR from 
9 cR from 

2 cR from 

5 cR from 
5 cR from 
5 cR from 
5 cR from 



top of Chr9 
top of Chr9 



op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op 0 
op o 
op o 
op o 
op o 
op o 
op o 
op o 
op 0 
op o 
op o 
:op o 
:op o 
:op o 
:op o 
:op o 
;op o 



Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr91 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 li 
Chr9 li 
Chr9 li 
Chr9 !i 
Chr9 li 
Chr9 Ii 
Chr9 li 
Chr9 I 
Chr9 I 
Chr9 li 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9 I 
Chr9l 
Chr9I 
Chr9 I 



lin 
lin 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
nk 
ink 
ink 
ink 
nk 
nk 
nk 
nk 
nk 



1994 
1829 
3074 
786 
2245 
2621 
2642 
2725 
2439 
2341 
3059 
2899 
2453 
2845 
2145 
2806 
3050 
3005 
2601 
2446 
2566 
2547 
3468 
2521 
2524 
2102 
3816 
2489 
998 
999 
1000 
1001 
4226 
4227 
3995 
4007 
4071 
4075 
4079 
4014 
3150 
3210 
3580 
3581 
1588 
1862 
2465 
2466 
2477 
3087 



WIAF I WIAF-3933 
WIAF I WIAF-3768 
WIAF I WIAF-975 
WIAF I WIAF-2049 
WIAF 1 WIAF-689 
WIAF I WIAF-277 
WIAF 1 WIAF-309 
WIAF 1 WIAF-430 
WIAF I WIAF-24 
WIAF I WIAF-2567 
WIAF I WIAF-960 
WIAF I WIAF-797 
WIAF 1 WIAF-41 
WIAF 1 WIAF-691 
WIAF 1 WIAF-433 
WIAF I WIAF-578 
WIAF I WIAF-951 
WIAF I WIAF-905 
WIAF I WIAF-245 
WIAF I WIAF-34 
WIAF I WIAF-194 
WIAF 1 WIAF-168 
WIAF I WIAF-1923 
WIAF I WIAF-130 
WIAF I WlAF-133 
WIAF I WIAF-166 
WIAF I WIAF-2570 
WIAF I WIAF-87 
WIAF I WIAF-2346 
WIAF I WIAF-2349 
WIAF I WIAF-2353 
WIAF I WIAF-2355 
MARSHFIELD | MID- 
MARSHFIELD | MID- 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMEtRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



18 
19 

I SNP-SHGC-10262 
I SNP-SHGC-1334 
] SNP-SHGC-14625 
I SNP-SHGC-15679 
I SNP-SHGC-16528 
I SNP-SHGC-3934 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-1051 

WIAF-1563 

WIAF-2035 

WIAF-2036 

WIAF-3527 

WIAF-3801 

WIAF-58 

WIAF-59 

WIAF-73 

WIAF-988 
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10 -6cM 

10 6.10 cR from top of 

10 17.30 cR from top o 

5 10 1 7.30 cR from top o 

10 19.70 cR from top a 

10 22.20 cR from top o 

10 28.50 cR from top o 

10 28.50 cR from top o 

10 10 29.00 cR "from top o 

10 29.00 cR from top a 

10 31.40 cR from top o 

10 31.80 cR from top o 

10 32.00 cR from top a 

15 10 36.30 cR from top a 

10 41.10 cR from top a 

10 43.3 cR from top of 

10 43.80 cR from top oi 

10 44.90 cR from top o1 

20 10 45.10 cR from top 01 

10 45.60 cR from top oi 

10 45.50 cR from top o 

10 52.00 cR from top o 

10 61.60 cR from top o" 

25 10 67.90 cR from top o 

10 79.40 cR from top oi 

10 80.20 cR from top o' 

10 83.30 cR from top o 

10 85.60 cR from top o 

30 1 0 89.40 cR from top o 

10 96.30 cR from top o 

10 96.90 cR from top o 

1 0 96.90 cR from top o 

10 96.90 cR from top o 

35 10 97.40 cR from top o 

10 97.60 cR from top o 

10 97.60 cR from top o 

10 105.3 cR from top o 

10 106.70 cR from top 

40 10 107.90 cR from top 

10 110.30 cR from top 

10 110.50 cR from top 

10 112.50 cR from top 

10 112.70 cR from top 

45 10 112.70 cR from top 

10 113.50 cR from top 

10 122.50 cR from top 

10 123 cM 

10 123.00 cR from top 

50 10 130.50 cR from top 



ChrlO link 
ChrlOl 
ChrlO I 
Chr10 I 
ChrlOl 
ChrlO I 
ChrlO I 
ChrlO I 
ChrlO li 
ChrlOl 
ChrlOl 
ChrlOl 
ChrlOl 
ChrlOl 
ChrlO link 
ChrlO lln 
ChrlO I 
ChrlO I 
ChrlO I 
ChrlO I 
ChrlOl 
ChrlO I 
ChrlOl 
ChrlOl 
ChrlOl 
ChrlOl 
ChrlO li 
ChrlOl 
ChrlO lln 
ChrlO li 
ChrlO I 
ChrlOl 
ChrlO I 
ChrlO li 
ChrlO I 
ChrlOl 
of ChrlOl 
of ChrlOl 
of ChrlO I 
of ChrlOl 
of ChrlO I 
of ChrlO I 
of ChrlO I 
of ChrlO I 
of ChrlO 11 

of ChrlO 11 
of ChrlO II 



4315 


UWGC 


1 134 


1310 


WIAF 1 


WIAF-2199 


1598 


WIAF 1 


WIAF-.3537 


1600 


WIAF 1 


WIAF-3539 


1271 


WIAF 1 


WIAF-2158 


2538 


WIAF 1 


WIAF-152 


3181 


WIAF 1 


WIAF-1497 


3182 


WIAF 


WIAF-1498 


4576 


HU-CHINA 1 10-1729 


3276 


WIAF 1 


WIAF-1729 


2016 


WIAF 1 


WlAF-1511 


1603 


WIAF 1 


WIAF-3542 


3265 


WIAF 1 


WIAF-1704 


2680 


WIAF 1 


WIAF-363 


3268 


WIAF 1 


WIAF-1713 


3289 


WIAF 1 


WIAF- 1742 


1675 


* WIAF 1 


WIAF-3614 


1820 


WIAF 1 


WIAF-3759 


1969 


WIAF 1 


WIAF-3908 


1183 


WIAF 1 


WIAF-1715 


1184 


WIAF 1 


WIAF-1716 


1363 


WIAF 1 


WIAF-3266 


1178 


WIAF 1 


WIAF-1707 


1858 


WIAF 1 


WIAF-3797 


1962 


WIAF 1 


WIAF-3901 


3230 


WIAF 


WIAF-1632 


1960 


WIAF 


WIAF-3899 


1772 


WIAF 


WIAF-371 1 


3517 


WIAF 1 


WIAF-1972 


1198 


WIAF 1 


WIAF-2085 


3213 


WIAF 1 


WIAF-1585 


1542 


WIAF 1 


WIAF-3481 


1651 


WIAF 1 


WIAF-3590 


1698 


WIAF 1 


WIAF-3637 


1554 


WIAF 1 


WIAF-3493 


1556 


WIAF 1 


WIAF-3495 


2428 


WIAF 1 


WIAF-9 


3187 


WIAF 1 


WIAF-1503 


1597 


WIAF 1 


WIAF-3536 


1499 


WIAF 1 


WIAF-3438 


3561 


WIAF 1 


WIAF-201 6 


1390 


WIAF 1 


WIAF-3295 


1413 


WIAF 1 


WIAF-3334 


1414 


WIAF 1 


WIAF-3335 


1752 


WIAF 1 


WIAF-3691 


1462 


WIAF 1 


WIAF-3396 


4321 


UWGC 


1 140 


1703 


WIAF 1 


WIAF-3642 


1762 


WIAF 1 


WIAF-3701 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



10 130, 

10 133, 

10 134, 

10 134, 

10 138, 

10 146 

10 146 

10 150. 

10 155 

10 180. 

10 185. 

10 293, 

10 306, 

10 343. 

10 356, 

10 359. 

10 366, 

10 382, 

10 384, 

10 389. 

10 425, 

10 431 

10 433 

10 437 

10 440 

10 442 

10 467 

10 467 

10 496 

10 505 

10 505 

10 515 

10 515 

10 537 

10 542 

10 551 

10 557 

10 558 

10 567 

10 598 

10 620 

10 623 

10 646 
10 
10 
10 
10 
10 
10 
10 



50 cR from top of ChrlO II 

6 cR from top of ChrlO lin 
80 cR from top of ChrlO I 
80 cR from top of ChrlO I 
90 cR from top of ChrlO li 
60 cR from top of ChrlO li 
80 cR from top of Chrl 0 li 
50 cR from top of ChrlO I 
30 cR from top of ChrlO I 
3 cR from top of ChrlO lin 

3 cR from top of ChrlO lin 

4 cR from top of ChrlO lin 

5 cR from top of ChrlO lin 

7 cR from top of ChrlO lin 

8 cR from top of ChrlO lin 
1 cR from top of Chrl 0 lin 

6 cR from top of ChrlO lin 

1 cR from top of ChrlO lin 

4 cR from top of ChrlO lin 

5 cR from top of ChrlO lin 

7 cR from top of ChrlO lin 
3 cR from top of ChrlO lin 

0 cR from top of ChrlO lin 

2 cR from top of ChrlO lin 

2 cR from top of ChrlO lin 

3 cR from top of ChrlO lin 

6 cR from top of ChrlO lin 
6 cR from top of ChrlO lin 
3 cR from top of ChrlO lin 

8 cR from top of ChrlO lin 
8 cR from top of ChrlO lin 

2 cR from top of ChrlO lin 
,7 cR from top of ChrlO lin 
,8 cR from top of ChrlO lin 
.2 cR from top of ChrlO lin 
.7 cR from top of ChrlO lin 
.3 cR from top of ChrlO lin 

3 cR from top of ChrlO lin 
5 cR from top of ChrlO lin 

4 cR from top of ChrlO lin 

5 cR from top of ChrlO lin 
8 cR from top of ChrlO lin 

1 cR from top of ChrlO lin 



1764 WIAF I WIAF-3703 

3035 WIAF I WIAF-935 

2045 WIAF I WIAF-1661 

1389 WIAF I WIAF-3294 

1165 WIAF I WIAF-1676 

1394 WIAF I WIAF-3299 

1822 WIAF I WIAF-3761 

3412 WIAF I WIAF-1867 

3337 WIAF j WIAF-1790 

3065 WIAF j WIAF-966 

2938 WIAF I WIAF-837 

798 WIAF I WIAF-4055 

2552 WIAF I WIAF'175 

939 WIAF I WIAF-1297 

2199 WIAF 1 WIAF-609 

3529 WIAF I WIAF-1984 

3141 WIAF I WIAF-1042 

2127 WIAF I WIAF-303 
2247 WIAF I WIAF-692 
2536 WIAF I WIAF-149 
2912 WIAF I WIAF-811 
2641 WIAF I WIAF-308 
2081 WIAF I WlAF-31 

2128 WIAF I WIAF-310 
3346 WIAF I WIAF-1799 
2277 WIAF I WIAF-744 

3434 wiAf= I WIAF-1889 

3435 WIAF I WIAF-1890 
2991 WIAF I WIAF-891 
2351 WIAF I WIAF-2588 
2353 WIAF 1 WIAF-2593 
2882 WIAF I WIAF-780 
3334 WIAF I WIAF-1787 
3416 WIAF I WIAF-1871 
3037 WIAF I WIAF-937 
3102 WIAF I WIAF-1003 
3506 WIAF I WIAF-1961 
3155 WIAF I WIAF-1056 
3853 WIAF I WIAF-2640 
2620 WIAF 1 WIAF-271 
3558 WIAF I WIAF-2013 
2879 WIAF I WIAF-777 

894 WIAF 1 WIAF-1142 

4153 SHGC/AFFYMETRIX 

4159 SHGC/AFFYMETRIX 

4076 SHGC/AFFYMETRIX 

4166 SHGC/AFFYMETRIX 

4103 SHGC/AFFYMETRIX 

4104 SHGC/AFFYMETRIX 
3976 SHGC/AFFYMETRIX 



I SNP-SHGC- 
I SNP-SHGC- 
I SNP-SHGC- 
1 SNP-SHGC- 
I SNP-SHGC- 
I SNP-SHGC- 
I SNP-SHGO 



14257 
14726 
15732 
23692 
30908 
31374 
35401 
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10 



.15 



20 



25 



30 



35 • 



40 



45 



50 



3.10 cR from top of Chrl 1 
3.10 cR from top of Chrl 1 
3.90 cR from top of Chrl 1 
3.90 cR from top of Chrl 1 
4.60 cR from top of Chrl 1 
1 5.50 cR from top of Chr1 
15,60 cR from top of Chrl 
1 5.60 cR from top of Chrl 
16.80 cR from top of Chr1 
1 8.50 cR from top of Chrl 
18.50 cR from top of Chrl 
23,50 cR from top of Chr1 
23.60 cR from top of Chrl 
23.70 cR from top of Chrl 
24.50 cR from top of Chr1 
26.50 cR from top of Chrl 
30.7 cR from top of Chrl 1 

34.9 cR from top of Chrl 1 

38.00 cR from top of Chrl 
38.3 cR from top of Chrl 1 
39.80 cR from top of Chrl 
43.7 cR from top of Chrl 1 
45.7 cR from top of Chrl 1 

52.1 cR from top of Chrl 1 

52.2 cR from top of Chr1 1 
58.7 cR from top of Chrl 1 
62.20 cR from top of Chrl 
62.30 cR from top of Chrl 
62.50 cR from top of Chrl 
62.50 cR from top of Chrl 
63.20 cR from top of Chrl 
63.20 cR from top of Chrl 
67.00 cR from top of Chrl 
68.0 cR from top of Chrl 1 
68.30 cR from top of Chr1 
68.30 cR from top of Chrl 
74.30 cR from top of Chr1 
76.50 cR from top of Chrl 
76.50 cR from top of Chrl 
76.60 cR from top of Chr1 

77.10 cR from top of Chrl 
77.50 cR from top of Chrl 



link 
link 
link 
link 
link 
I 



1 
1 
1 
1 li 
1 li 
1 I 
1 li 
1 li 
1 li 
1 I 
1 li 
link 
link 
1 lin 
link 
1 lin 
link 
link 
link 
link 
link 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
link 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 
1 lin 



3724 
1530 
1691 
3882 
2483 
3001 
3079 

1544 
1683 

685 

694 
1123 
1900 

665 
1710 
1392 
1568 
1571 

699 
2987 
1580 
1355 
1553 
2615 

784 
3494 
2921 
1531 
3316 
3400 
2192 
2176 
3502 
1958 
1371 
1560 
1917 
3198 
3199 
1277 
2233 
2056 
1330 
3158 
1361 
1720 
1306 
1901 
2022 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 

WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 

WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF. 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF" 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



2452 

3469 

3630 

3943 

80 

901 

980 

3483 

3622 

1160 

1245 

1566 

3839 

1061 

3649 

3297 

3507 

3510 

1271 

887 

3519 

3258 

3492 

262 

2043 

1949 

820 

3470 

1769 

1855 

591 

551 

1957 

3897 

3274 

3499 

3856 

1535 

1536 

2164 

668 

1719 

2220 

1059 

3264 

3659 

2195 

3840 

1529 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



80.10 cR from top of Chr11 lin 
81 ,90 cR from top of Chr1 1 lin 
82.90 cR from top of Chrl 1 lin 
83.20 cR from top of Chrl 1 lin 
84.20 cR from top of Chrl 1 lin 
86.90 cR from top of Chrl 1 lin 
89.8 cR from top of Chrl 1 link 
89.8 cR from top of Chrl 1 link 
93.30 cR from top of Chrl 1 lin 
94.10 cR from top pf Chr1 1 lin 
97.90 cR from top of Chrl 1 lin 
98.40 cR from top of Chrl 1 lin 
98.40 cR from top of Chr1 1 lin 
1 02.60 cR from top of Chrl 1 .1 
106.6 cR from top of Chrl 1 lin 
106.80 cR from top of Chrll I 
106.80 cR from top of Chrl 1 I 
107.90 cR from top of Chrl 1 I 
108.00 cR from top of Chrl 1 1 
108.10 cR from top of Chrll I 
112.50 oR from top of Chrll I 
113 cM 

1 16.20 cR from top of Chrl 1 I 
1 17.40 cR from top of Chrl 1 I 
1 1 8.40 cR from top of Chrl 1 1 
11 8.40 cR from top of Chrl 1 I 
1 20.00 cR from top of Chrl 1 I 
1 20.00 cR from top of Chrl 1 I 
1 20. 1 0 cR from top of Chrl 1 I 
1 20.70 cR from top of Chrl 1 li 

1 26.00 cR from top of Chrl 1 li 
1 26.60 cR from top of Chrl 1 I 
1 31 .70 cR from top of Chrl 1 I 
145.60 cR from top of Chrl 1 I 
149.80 cR from top of Chrl 1 I 

1 50.6 cR from top of Chrl 1 lin 

1 66.3 cR from top of Chrl 1 lin 
171.5 cR from top of Chrll lin 

305.1 cR from top of Chrl 1 lin 
314.5 cR from top of Chrl 1 lin 
323.0 cR from top of Chr1 1 lin 

323.0 cR from top of Chrl 1 lin 

344.4 cR from top of Chr1 1 lin 

359.1 cR from top of Chrl 1 lin 
359.1 cR from top of Chrl 1 lin 
359.1 cR from top of Chrl 1 lin 
376.1 cR from top of Chrl 1 lin 
377.9 cR from top of Chrl 1 lin 

379.5 cR from top of Chrl 1 lin 

387.7 cR from top of Chrl 1 lin 



1645 


WIAF 


1 WIAF-3584 


1202 


WIAF 


1 WIAF-2089 


2001 


WIAF 


WIAF-3940 


3091 


WIAF 


WIAF-992 


3170 


WIAF 


WlAF-1480 


3162 


WIAF 


WIAF-1465 


996 


WIAF 


WIAF-2045 


3585 


WIAF 


WIAF-2046 


1217 


WIAF 


i WIAF-2104 


3497 


WIAF 


j WIAF-1952 


1122 


WIAF 


WIAF-l'564 


1095 


WIAF 


WIAF-1483 


1096 


WIAF 


WIAF-1484 


3495 


WIAF 


WIAF-1950 


864 


WIAF 


WIAF-1075 


1868 


WIAF 


WIAF-3807 


1869 


WIAF 


WIAF-3808 


3202 


WIAF 


WIAF-1542 


2049 


WIAF 


WIAF-1688 


1478 


WIAF 


WIAF-3417 


1103 


WIAF 


WIAF-1525 


4311 


UWGC 


1 130 


1909 


WIAF 


WIAF-3848 


675 


WIAF 


WIAF- 1440 


1714 


WIAF 


WIAF-3653 


1715 


WIAF 


WIAF-3654 


1830 


WIAF 


WIAF-3769 


1831 


WIAF 


WIAF-3770 


1943 


WIAF 


WIAF-3882 


3258 


WIAF 


WIAF-1694 


1854 


WIAF 


WIAF-3793 


1386 


WIAF 


WIAF-3291 


1899 


WIAF 


WIAF-3838 


1806 


WIAF 


WIAF-3745 


3267 


WIAF 


WIAF-1712 


1089 


WIAF 


WIAF-3183 


2942 


WIAF 


WlAF-841 


3829 


WIAF 


WIAF-2598 


3831 


WIAF 


WIAF-2600 


2892 


WIAF 


WIAF-790 


2606 


WIAF 


WIAF-252 


2607 


WIAF 


WIAF-253 


3407 


WIAF 


WIAF-1862 


2530 


WIAF 


WIAF-141 


2531 


WIAF 


WIAF-142 


2532 


WIAF 


WIAF-143 


3411 


WIAF 


WIAF-1866 


2533 


WIAF 


WIAF-146 


2484 


WIAF 


WIAF-81 


2592 


WIAF 


WlAF-228 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



387.7 
389.8 
392.6 
403.4 
419.0 
419.0 
421.1 
428.6 
432.6 
458.3 
466.7 
488.1 
506.0 
522.5 
573.0 
604.0 
604.0 
624.2 



cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chr1 1 


lin 


cR 


from 


top 


of Chrl 1 


iin 


cR 


from 


top 


of Chrll 


lin 


cR 


from 


top 


of Chr1 1 


lin 


cR 


from 


top 


of Chr1 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chr11 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrll 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chr1 1 


lin 


cR 


from 


top 


of Chrl 1 


lin 


cR 


from 


top 


of Chrl 1 


Irn 



0.70 cR from top of Chrl 2 link 
10.70 cR from top of Chrl 2 lin 
1 3.80 cR from top of Chrl 2 lin 
1 5.1 0 cR from top of Chrl 2 lin 
19.60 oR from top of Chrl 2 lin 
20.40 cR from top of Chrl 2 lin 
28.30 cR from top of Chr12 lin 
28.40 cR from top of Chrl 2 lin 
29.30 cR from top of Chrl 2 lin 
31 .80 cR from top of Chrl 2 lln 
32 cM 

32-80 cR from top of Chrl 2 lln 



2593 
2857 
2943 
3815 
2590 
2591 
3371 
3069 
2103 
3478 
3608 
3134 
1086 
2485 
2736 
2727 
3070 
3379 
4228 
3959 
3965 
4064 
4073 
4093 
4013 
4046 
2429 
1090 
1134 
3214 
3259 
3313 
3314 
3543 
1638 
1666 
3090 

1662 
3553 
3353 
1573 
3352 
674 
1967 
2862 
1625 
1848 
4322 
2979 



WIAF I WIAF-229 
WIAF I WIAF-750 
WIAF I WIAF-842 
WIAF t WIAF-2569 
WIAF I WIAF-226 
WIAF I WiAF-227 
WIAF I WIAF-1824 
WIAF I WIAF-970 
WIAF I WIAF- 174 
WIAF 1 WIAF-1933 
WIAF I WIAF-2322 
WIAF j WIAF-1035 
WIAF j WIAF-2071 
WIAF I WIAF-82 
WIAF I WIAF-447 
WIAF 1 WIAF-432 
WIAF I WIAF-971 
WIAF i WIAF-1832 
MARSHFIELD | MID-21 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



SNP-SHGC- 
SNP'SHGO 
SNP^SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 



10796 

11902 

13369 

15155 

17309 

3925 

9225 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

UWGC 

WIAF I 



WIAF-10 

WIAF-1463 

WIAF-1581 

WIAF-1588 

WIAF-1695 

WIAF-1766 

WIAF-1767 

WIAF-1998 

WIAF-3577 

WIAF-3605 

WIAF-991 



WIAF- 

WIAF- 

WIAF- 

WIAF- 

WJAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

I 141 

WIAF- 



3601 

2008 

1806 

3512 

1805 

1430 

3906 

759 

3564 

3787 

879 
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1 2 40.50 cR from top of Chrl 2 Iln 

1 2 52.40 cR from top of Chr1 2 lin 

1 2 53.70 cR from top of Chrl 2 lin 

1 2 53,90 cR from top of Chrl 2 lin 

5 .12 54.00 cR from top of Chr1 2 lin 

1 2 54.40 cR from top of Chrl 2 Iln 

1 2 54.40 cR from top of Chrl 2 lin 

1 2 57.40 cR from top of Chrl 2 lin 

1 2 57.40 cR from top of Chrl 2 lin 

10 12 57.40 cR from top of Chrl 2 lin 

1 2 62.80 cR from top of Chrl 2 lin 

1 2 62.90 cR from top of Chrl 2 Iln 

1 2 65.70 cR from top of Chr1 2 Iln 

1 2 66.90 cR from top of Chrl 2 lin 

15 12 67.00 cR from top of Chrl 2 lin 

12 67.20 cR from top of Chrl 2 lin 

1 2 69.60 cR from top of Chrl 2 lin 

1 2 69.90 cR from top of Chrl 2 lin 

1 2 70.60 cR from top of Chrl 2 lin 

20 12 71.10 cR from top of Chrl 2 lin 

1 2 72.30 cR from top of Chrl 2 lin 

1 2 74.50 cR from top of Chrl 2 lin 

1 2 75.40 cR from top of Chrl 2 lin 

1 2 75.80 cR from top of Chrl 2 lin 

25 1 2 75.80 cR from top of Chrl 2 lin 

12 75.80 cR from top of Chrl 2 lin 

1 2 75.80 cR from top of Chrl 2 lin 

1 2 76.50 cR from top of Chrl 2 lin 

1 2 78.60 cR from top of Chrl 2 Iln 

30 1 2 78.60 cR from top of Chrl 2 lin 

12 79.10 cR from top of Chrl 2 Iln 

12 80.10 cR from top of Chrl 2 lin 

1 2 83.4 cR from top of Chrl 2 link 

1 2 97.00 cR from top of Chrl 2 lin 

35 12 1 00.9 cR from top of Chrl 2 lin 

12 1 01 .90 cR from top of Chr1 2 li 

1 2 1 03.9 cR from top of Chrl 2 lin 

1 2 1 08.70 cR from top of Chrl 2 li 

12 11 1 .0 cR from top of Chrl 2 lin 

40 12 116.60 cR from top of. Chrl 2 li 

1 2 1 1 7.50 cR from top of Chrl 2 I 

12 1 1 9.00 cR from top of Chrl 2 I 

12 1 1 9.00 cR from top of Chrl 2 I 

12 1 1 9.00 cR from top of Chrl 2 li 

45 12 1 23,90 cR from .top of Chrl 2 I 

1 2 1 34.40 cR from top of Chrl 2 li 

1 2 1 36.30 cR from top of Chrl 2 I 

12 1 46.30 cR from top of Chrl 2 I 

1 2 1 46.30 cR from top of Chrl 2 li 

50 12 1 52.2 cR from top of Chrl 2 Iln 



1391 
1976 
3477 
3569 
1320 
3461 
3462 
3262 
3532 
3533 
2051 
1692 
1800 

913 
1632 
2046 
3597 
2561 
3089 
3525 
2495 
2050 
1853 
1313 
1314 
1315 
1730 
1799 
3234 
3235 
1705 
1409 
3335 
3377 
2507 
1610 
3034 
1397 
2689 
1972 
1991 

676 
1639 
1642 
3554 
3576 
2452 
1685 
1687 
3317 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF« 
WIAF' 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF. 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF. 
WIAF- 



3296 
3915 
1932 
2024 
2210 
1916 
1917 
1698 
1987 
1988 
1692 
3631 
3739 
1192 
3571 
1662 
2202 
188 
990 
1980 
93 
1691 
3792 
2203 
2204 
2205 
3669 
3738 
1643 
1644 
3644 
3330 
1788 
1830 
113 
3549 
934 
3302 
374 
3911 
3930 
1442 
•3578 
•3581 
2009 
■2031 
•40 
•3624 
3626 
1770 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 



1 52.4 cR from top of Chrl 2 lin 

152.4 cR from top of Chrl 2 lin 
1 65.90 cR from top of Chrl 2 li 
169.10 cR from top of Chrl 2 li 
169.10 cR from top of Chr12 li 

182.2 cR from top of Chrl 2 lin 

239.3 cR from top of Chrl 2 
295.9 cR from top of Chrl 2 
317.2 cR from top of Chrl 2 
31 7.2 cR from top of Chrl 2 

323.2 cR from top of Chrl 2 

327.8 cR from top of Chrl 2 

364.1 cR from top of Chrl 2 

368.3 cR from top of Chrl 2 

378.5 cR from top of Chr12 
378.5 cR from top of Chr1 2 

378.7 cR from top of Chrl 2 

390.2 cR from top of Chrl 2 lin 

396.1 cR from top of Chrl 2 

396.2 cR from top of Chrl 2 

396.2 cR from top of Chrl 2 
419.5 cR from top of Chrl 2 
419.5 cR from top of Chrl 2 

439.4 cR from top of Chrl 2 

476.9 cR from top of Chrl 2 
478.9 cR from top of Chrl 2 

483.3 cR from top of Chrl 2 

526.5 cR from top of Chr12 

557.3 cR from top of Chrl 2 

580.4 cR from top of Chrl 2 lin 

600.8 cR from top of Chr1 2 lin 
603,7 cR from top of Chrl 2 lin 

614.7 cR from top of Chrl 2 lin 

617.8 cR from top of Chrl 2 lin 
621 .0 cR from top of Chrl 2 lin 

627.9 cR from top of Chrl 2 lin 
628.7 cR from top of Chrl 2 lin 

644.5 cR from top of Chrl 2 lin 



959 WIAF I WIAF-1343 

1008 WIAF I WIAF-4037 

2957 WIAF I WIAF-856 

1562 WIAF I WIAF-3501 

1567 WIAF 1 WIAF-3506 

2506 WIAF I WIAF-1 1 1 

2385 WIAF I WIAF-2679 

3044 WIAF I WIAF-945 

2597 WIAF I WIAF-237 

2598 WIAF I WIAF-238 
2549 WIAF I WlAF-170 
2744 WIAF I WIAF-463 
3055 WIAF I WIAF'.956 
2270 WIAF I WIAF-728 

2377 WIAF I WIAF-2658 

2378 WIAF I WIAF-2661 
2919 WIAF I WIAF-818 
2927 WIAF I WIAF-826 
3140 WIAF 1 WIAF-1 041 

2883 WIAF I WIAF-781 

2884 WIAF I WIAF-782 

3151 WIAF I WIAF-1 052 

3152 WIAF I WIAF-1 053 
2583 WIAF I WIAF-216 

731 WIAF 1 WIAF-1 088 

2117 WIAF I WIAF-251 

2137 WIAF i WIAF-377 

3463 WIAF I WIAF-1 91 8 

2794 WIAF I WIAF-552 

3157 WIAF I WIAF-1058 

2737 WIAF I WIAF-449 

2681 WIAF I WIAF-364 

745 WIAF I WIAF-1 21 4 

2690 WIAF I WIAF-375 

1261 WIAF I WIAF-2148 

2814 WIAF I WIAF-607 

2964 WIAF I WIAF-864 

2180 WIAF I WIAF-562 

4118 SHGC/AFFYMETRIX 

4128 SHGC/AFFYMETRIX 

4003 SHGC/AFFYMETRIX 

4066 SHGC/AFFYMETRIX 

4070 SHGC/AFFYMETRIX 

4163 SHGC/AFFYMETRIX 

4078 SHGC/AFFYMETRIX 

4108 SHGC/AFFYMETRIX 

4036 SHGC/AFFYMETRIX 

4051 SHGC/AFFYMETRIX 

3498 WIAF I WIAF-1 953 

3772 WIAF I WIAF-2500 



SNPA-SHGC-13972 

SNPB-SHGC-13972 

SNP-SHGC-12981 

SNP-SHGC-13464 

SNP-SHGC-13943 

SNP-SHGC-14942 

SNP-SHGC-16483 

SNP-SHGC-35771 

SNP-SHGC-7632 

SNP-SHGC-9454 
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2988 WIAF I WIAF-888 



12 

1 3 1 8.60 cR from top of Chrl 3 lin 1 655 WIAF 

1 3 23.30 cR from top of Chrl 3 lin 1 855 WIAF 

5 13 24.00 cR from top of Chrl 3 lin 3527 WIAF 

1 3 24.00 cR from top of Chrl 3 lin 3528 WIAF 

1 3 27.70 cR from top of Chrl 3 lin 2044 WIAF 

13 41.20cRfromtopof Chr13lin 1251 WIAF 

i 3 42.30 cR from top of Chrl 3 lin 1 438 WIAF 

10 13 44.10 cR from top of Chrl 3 lin 3342 WIAF 

1 3 46.60 cR from top of Chrl 3 lin 31 85 WIAF 

1 3 46,60 cR from top of Chrl 3 lin 31 86 WIAF 

1 3 46.7 cR from top of Chrl 3 link 923 WIAF 

1 3 47.50 cR from top of Chrl 3 lin 2551 WIAF 

15 13 47.60 cR from top of Chrl 3 lin 3403 WIAF 

1 3 47.90 cR from top of Chrl 3 lin 1 837 WIAF 

13 58.00 cR from top of Chrl 3 lin 1451 WIAF 

13 58.50 cRfromtopof Chr13lin 1444 WIAF 

1 3 59.40 cR from top of Chrl 3 lin 1 604 WIAF 

20 13 59.80 cRfromtopof Chr13lin 1422 WIAF 

13 59.80 cRfromtopof Chr13lin 1424 WIAF 

1 3 59.80 cR from top of Chrl 3 lin 1 425 WIAF 

1 3 62,00 cR from top of Chrl 3 lin 1 897 WIAF 

1 3 66.20 cR from top of Chrl 3 lin 2542 WIAF 

25 13 69.80 cR from top of Chrl 3 lin 3196 WIAF 

1 3 72.00 cR from top of Chrl 3 lin 1 31 7 WIAF 

13 72.50 cRfromtopof Chr13lin 1453 WIAF 

13 76.1 cR from top of Chrl 3 link 3864 WIAF 

13 76.1 cR from top of Chrl 3 link 3865 WIAF 

30 13 77.10cRfromtopof Chr13lin 1365 WIAF 

1 3 78.30 cR from top of Chrl 3 lin 1 81 7 WIAF 

13 79.2 cR from top of Chrl 3 link 930 WIAF 

1 3 83.4 cR from top of Chrl 3 link 261 0 WIAF 

13 87.1 cR from top of Chrl 3 link 2047 WIAF 

35 13 87.1 cR from top of Chrl 3 link 2048 WIAF 

13 89.10 cRfromtopof Chr13lin 2930 WIAF 

13 92.80 cR from top of Chrl 3 lin 1612 WIAF 

13 1 1 7.50 cR from top of Chrl 3 li 1411 WIAF 

1 3 1 22.3 cR from top of Chrl 3 lin 31 39 WIAF 

40 13 1 25. 1 cR from top of Chrl 3 lin 781 WIAF 

13 1 25. 1 cR from top of Chrl 3 lin 782 WIAF 

13 1 25. 1 cR from top of Chrl 3 lin 783 WIAF 

1 3 1 34.3 cR from top of Chrl 3 lin 31 56 WIAF 

13 143.1 cR from top of Chr13 lin 2471 WIAF 

45 13 1 44.1 cR from top of Chrl 3 lin 2099 WIAF 

13 1 44. 1 cR from top of Chrl 3 lin 21 00 WIAF 

1 3 1 45.4 cR from top of Chrl 3 lin 301 1 WIAF 

13 1 45.4 cR from top of Chrl 3 lin 301 2 WIAF 

13 1 45.4 cR from top of Chrl 3 lin 3064 - WIAF 

50 13 1 49.7 cR from top of Chrl 3 lin 3481 WIAF 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF" 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



3594 
3794 
1982 
1983 
1647 
2138 
3367 
1795 
1501 
1502 
1227 
■173 
1858 
3776 
3384 
3374 
-3543 
■3350 
3352 
•3353 
3836 
159 
1531 
2207 
•3386 
■2657 
■2659 
•3268 
•3756 
-1270 
■256 
-1686 
1687 
-829 
3551 
■3332 
•1040 
1455 
-1456 
-1457 
-1057 
-65 
■156 
-157 
-911 
•912 
•965 
-1936 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 
13 



14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 
14 



152.4 cR 
192.1 cR 
192.1 cR 
195.4 cR 
275.9 cR 
288.1 cR 
295.6 cR 
296.6 cR 



from top 
from top 
from top 
from top 
from top 
from top 
from top 
from top 



of Chr13 lin 
of Chrl 3 lin 
of Chrl 3 lln 
of Chr13 lin 
of Chrl 3 lin 
of Chrl 3 lin 
of Chrl 3 lin 
of Chrl 3 lin 



3.30 cR from top of Chrl 4 link 
3.30 cR from top of Chr14 link 
3.30 cR from top of Chrl 4 link 
3.30 cR from top of Chr14 link 
5.90 cR from top of Chrl 4 link 

13.5 cR from top of Chrl 4 link 
17.00 cR from top of Chrl 4 lln 

20.6 cR from top of Chrl 4 link 

20.6 cR from top of Chrl 4 link 
22.4 cR from top of Chrl 4 link 

27.7 cR from top of Chrl 4 link 
27.7 cR from top of Chrl 4 link 
32.10 cR from top of Chrl 4 lin 
36.50 cR from top of Chrl 4 lin 
36.50 cR from top of Chrl 4 lln 
37.00 cR from top of Chrl 4 lin 
37.10 cR from top of Chrl 4 lin 
42.40 cR from top of Chrl 4 lin 
46.10 cR from top of Chrl 4 lin 
53.80 cR from top of Chrl 4 lin 
54.60 cR from top of Chrl 4 lin 
56.20 cR from top of Chrl 4 lin 
59.70 cR from top of Chrl 4 lin 
62.00 cR from top of Chrl 4 lin 
63.60 cR from top of Chrl 4 lln 



3137 
2786 
2787 
789 
2197 
2159 
3842 
4562 
4229 
4143 
4146 
4083 
4096 
4008 
4102 
4022 
4034 
4039 
3389 
2982 
2491 
2492 
3072 



3304 
3305 
3306 
3307 
2785 
2796 
1948 
4567 
2271 
3845 
2696 
2906 
2915. 
1428 
1429 
1709 
701 
1260 
1689 
1834 
1379 
1701 
1961 
1838 
3281 



WIAF I WIAF-1038 
WIAF I WIAF-540 
WIAF I WlAF-541 
WIAF 1 WIAF-2062 
WIAF 1 WIAF-600 
WIAF I WIAF-497 
WIAF j WIAF-2619 
HU-CHINA I 13-401-1 
MARSHFIELD | MID-22 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX j SNP- 
SHGC/AFFYMETRIX j SNP- 
SHGC/AFFYMETRIX | SNP- 
SHGC/AFFYMETRIX I SNP- 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
HU-( 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



CHINA 



WIAF-1843 

WIAF-882 

WIAF-89 

WIAF-90 

WIAF-973 



WIAF-1757 
WIAF-1758 
WIAF-1759 
WIAF-1760 
WIAF-536 
WIAF-554 
WIAF-3887 
14-729 
WIAF-729 
WIAF-2624 
WIAF-382 
WlAF-804 
WIAF-814 
WIAF-3356 
WIAF-3357 
WIAF-3648 
WIAF-1296 
WIAF-2147 
WIAF-3628 
WIAF-3773 
WIAF-3283 
WIAF-3640 
WIAF-3900 
WIAF-3777 
WIAF-1734 



SHGC- 
SHGC- 
SHGO 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 
SHGC- 



13649 

13999 

16887 

18881 

2426 

30142 

4718 

6784 

8465 
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14 66.50 cR from top of ehr14 lin 

14 66.50 cR from top of'Chr14 lin 

14 66.80 cR from top of Chr14 lin 

14 66.80 cR from top of ChrU lin 

5 14 66.80 cR from top of Chrl 4 lin 

14 69.0 cR from top of Chr14 link 

14 71 .50 cR from top of ChrU lin 

14 75 cM 

1 4 86.30 cR from top of Chrl 4 lin 

10 14 95.5 cR from top of ChrU link 

14 99.60 cR from top of ChrU lin 

14 1 01 .00 cR from top of Chrl 4 I 

14 1 01 .00 cR from top of Chrl 4 I 

14 1 0 1 .00 cR from top of Chrl 4 I 

15 14 1 09.00 cR from top of Chrl 4 1 

1 4 1 09.00 cR from top of Chrl 4 I 

14 1 21 .60 cR from top of Chrl 4 1 

1 4 1 24.20 cR from top of Chrl 4 I 

U 124,20 cR from top of ChrU 1 

20 14 124.20 cR from top of ChrU 1 

1 4 1 24.20 cR from top of Chrl 4 1 

14 124.20 cR from top of ChrU I 

14 1 25.8 cR from top of ChrU lin 

14 1 26.2 cR from top of Chrl 4 I 

25 14 141 .7 cR from top of ChrU I 

14 1 67.2 cR from top of ChrU I 

14 168.5 cR from top of ChrU I 

14 168.5 cR from top of ChrU I 

14 1 74. 5 cR from top of Chrl 4 I 

30 14 1 79. 1 cR from top of Chrl 4 I 

14 1 97.4 cR from top of Chrl 4 I 

14 1 97.6 cR from top of ChrU I 

14 228.7 cR from top of ChrU I 

14 248.8 cR from top of ChrU I 

35 14 252.9 cR from top of ChrU I 

U 252.9 cR from top of ChrU I 

14 253.1 cR from top of ChrU I 

14 253.4 cR from top of ChrU I 

14 255,0 cR from top of ChrU I 

40 14 255.1 cR from top of ChrU li 

14 255.3 cR from top of ChrU I 

14 263.3 cR from top of ChrU I 

U 278.2 cR from top of ChrU I 

14 298 . 2 cR from top of Chrl 4 I 

45 14 308.8 cR from top of ChrU I 

1 4 324.3 cR from top of Chrl 4 lin 

1 4 324.3 cR from top of Chrl 4 lin 

14 335.6 cR from top of ChrU lin 

14 335.6 cR from top of ChrU lin 

50 U 335.6 cR from top of ChrU lin 



3206 
3207 
1402 
1406 
1408 
1080 
1561 
4309 
3390 
2900 
1669 
1137 
1138 
1139 
1539 
1541 
1398 
1432 
1631 
1803 
1805 
1908 
3545 

744 
2249 

751 

863 
1011 
3301 
3542 
2640 
3465 
2859 
2709 
3135 
3136 
2924 
3876 
3361 
3325 
2602 
2599 
3132 
2240 

961 
2936 
2937 
4582 
4555 
3583 



WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

UWGC 

WIAF 

WIAF" 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF-: 
WIAF- 
WIAF-: 
I 128 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF-: 
WIAF 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF- 
WIAF 
WIAF- 
WIAF 
WIAF- 
WIAF 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 



HU-CHINA I 1 
HU-CHINA I 1 
WIAF I WIAF 



1557 
1558 
3322 
3326 
3329 
2051 
3500 

1845 
798 
3608 
1600 
1601 
1602 
3478 
3480 
3303 
3360 
3570 
3742 
3744 
3847 
2000 
1211 
694 
1263 
1073 
4046 
1754 
1997 
307 
1920 
754 
402 
1036 
1037 
823 
2677 
1814 
1778 
246 
239 
1033 
680 
1352 
835 
836 
4-2041 
4-2041-2 
2041 
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4 cR from top of Chr14 lin 

5 cR from top of Chr14 lin 

6 cR from top of Chrl 4 lin 
0 cR from top of Chr14 lin 

7 cR from top of Chrl 4 lin 



14 352. 
14 355. 
14 355, 
14 359, 
5 14 363, 
14 
.14 
14 
14 . 
10 14 
14 
14 
14 
14 

15 14 
14 
14 
14 

14 

20 14 
14 
14 



15 0.00 cR from top of Chrl 5 link 

25 1 5 4.70 cR from top of Chrl 5 link 

1 5 5.40 cR from top of Chrl 5 link 

1 5 9.60 cR from top of Chrl 5 link 

15 1 1 .00 cR from top of Chrl 5 lin 

15 13.7 cR from top of Chrl 5 link 

30 15 1 3.7 cR from top of Chrl 5 link 

1 5 1 7.70 cR from top of Chrl 5 lin 

1 5 21 .70 cR from top of Chrl 5 lin 

1 5 22.90 cR from top of Chrl 5 lin 

1 5 25.50 cR from top of Chrl 5 lin 

35 1 5 28.9 cR from top of Chrl 5 link 

1 5 28.9 cR from top of Chrl 5 link 

1 5 37.30 cR from top of Chrl 5 lin 

1 5 38.20 cR from top of Chrl 5 lin 

1 5 42.0 cR from top of Chrl 5 link 

40 1 5 42.70 cR from top of Chrl 5 lin 

15 46 cM 

1 5 46.20 cR from top of Chrl 5 lin 

1 5 46.30 cR from top of Chrl 5 lin 

1 5 46.30 cR from top of Chrl 5 lin 

45 1 5 46.30 cR from top. of Chrl 5 lin 

1 5 46.40 cR from top of Chrl 5 lin 

1 5 46.8 cR from top of Chr1 5 link 

1 5 46.90 cR from top of Chrl 5 lin 

15 47 cM 

50 15 48.10 cR from top of Chrl 5 lin 



3362 
2653 
3440 
3294 
2647 
3997 
4137 
4157 
4088 
4101 
4168 
4032 
4045 
3125 
666 
3595 
3596 
1328 
1589 
2728 
2729 
2974 

3577 
1924 
1712 
2880 

696 
2931 
2932 

.710 
3547 
3153 
2904 
2058 
3032 
1968 
2473 
2625 
2469 
4304 
3231 
3513 
1670 
2426 
1944 
2956 
1382 
4305 
1951 



WIAF I WIAF-1815 

WIAF I WIAF-327 

WIAF I WIAF-1895 

WIAF I WIAF-1747 

WIAF I WIAF-318 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

UWGC 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

UWGC 

WIAF I 



WIAF-1026 

WIAF-1072 

WIAF-2187 

WIAF-2188 

WIAF-2218 

WIAF-3528 

WIAF-434 

WIAF-435 

WIAF-874 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
I 123 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
I 124 
WIAF 



2032 
-3863 
■3651 
778 
1254 
•830 
831 
•1439 
■2002 
■1054 
802 
2543 
932 
3907 
'68 
284 
62 

•1633 
■1968 
3609 
6 

-3883 
855 
3287 

-3890 



SNP-SHGC- 
SNi=>-SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 
SNP-SHGC- 



1127 

13065 

14530 

17097 

19244 

23875 

6098 

9043 
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15 • 48.20 cR from top of Chr15 lin 3232 WIAF 

1 5 49.70 cR from top of Chrl 5 lin 695 WIAF 

15 49.9 cR from top of Chrl 5 link 2893 WIAF 

15 53 cM 4318 UWGC 

5 1 5 53.70 cR from top of Chrl 5 lin 31 76 WIAF 

15 60.90cRfromtopof Chr15 1in 1695 WIAF 

1 5 65.30 cR from top of Chrl 5 lin 1 434 WIAF 

15 65.30cRfromtopof Chris lin 1436 WIAF 

1 5 65.8 cR from top of Chrl 5 link 3093 WIAF 

10 15 65.8 cR from top of Chrl 5 link 3094 WIAF 

1 5 70.7 cR from top of Chrl 5 link 3573 WIAF 

1 5 71 .30 cR from top of Chrl 5 lin 3489 WIAF 

15 71 .30 cR from top of Chrl 5 lin 3490 WIAF 

1 5 71 .80 cR from top of Chrl 5 lin 3557 WIAF 

15 15 72.20 cR from top of Chrl 5 lin 1578 WIAF 

1 5 72.70 cR from top of Chrl 5 lin 1 680 WIAF 

1 5 73.70 cR from top of Chrl 5 lin 31 75 WIAF 

1 5 74.90 cR from top of Chrl 5 lin 1 974 WIAF 

1 5 75.50 cR from top of Chrl 5 lin 3452 WIAF 

20 1 5 75.70 cR from top of Chrl 5 lin 31 60 WIAF 

1 5 76.40 cR from top of Chrl 5 lin 1 937 WIAF 

15 76.6 cR from top of Chr1 5 link 3719 WIAF 

15 77.30 cR from top of Chrl 5 lin 1835 WIAF 

1 5 77.40 cR from top of Chrl 5 lin 201 5 WIAF 

25 15 78.60 cR from top of Chrl 5 lin 1 744 WIAF 

15 85.4cRfromtopof Chrl 5 link 2101 WIAF 

1 5 94.40 cR from top of Chrl 5 lin 11 90 WIAF 

15 97,60 cR from top of Chrl 5 lin 1 760 WIAF 

1 5 1 02.60 cR from top of Chrl 5 ii 1 648 WIAF 

30 15 1 04.40 oR from top of Chrl 5 Ii 1910 WIAF 

15 105.4 cR from top of Chrl 5 lin 2646 WIAF 

15 105.5 cR from top of Chrl 5 lin 2711 WIAF 

1 5 1 08.70 cR from top of Chrl 5 Ii 1911 WIAF 

1 5 106.70 cR from top of Chrl 5 Ii 1914 WIAF 

35 15 121.4 cR from top of Chrl 5 lin 3496 WIAF 

15 1 21 .7 cR from top of Chrl 5 lin 3406 WIAF 

1 5 1 33.5 cR from top of Chrl 5 lin 381 2 WIAF 

1 5 1 39.7 cR from top of Chrl 5 lin 21 74 WIAF 

15 142.1 cR from top of Chrl 5 lin 2097 WIAF 

40 15 1 44.2 cR from top of Chrl 5 lin 2660 WIAF 

1 5 1 52.5 cR from top of Chrl 5 lin 2490 WIAF 

1 5 1 59.6 cR from top of Chrl 5 lin 2345 WIAF 

1 5 1 80.8 cR from top of Chrl 5 lin 2679 WIAF 

1 5 1 94.2 cR from top of Chr1 5 lin 2585 WIAF 

45 15 195.6 cR from top of Chrl 5 lin 3099 WIAF 

1 5 202.7 cR from top of Chrl 5 lin 2721 WIAF 

1 5 207.0 cR from top of Chr1 5 lin 2539 WIAF 

1 5 228.4 cR from top of Chrl 5 lin 31 1 0 WIAF 

1 5 228.4 cR from top of Chrl 5 lin 31 1 1 WIAF 

50 1 5 228.6 cR from top of Chrl 5 lin 2949 WIAF 



WIAF- 

WIAF- 

WIAF- 

I 137 

WIAF- 

WIAF- 

WIAF. 

WIAF- 

WIAF 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAFr- 

WIAF- 

WIAF- 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 

WIAF 



1635 
1248 
791 

1491 
3634 
3362 
3364 
994 
995 
2028 
1944 
1945 
2012 
3517 
3619 
1490 
3913 
1907 
1459 
3876 
2447 
3774 
1509 
3683 
163 
2077 
3699 
3587 
3849 
317 
406 
3850 
3853 
1951 
1861 
2564 
547 
140 
336 
88 

'2576 
•362 
•219 
•1000 
■425 
■153 
■1011 
•1012 
-848 



-147- 



wo 01/62968 



PCT/USOl/00452 



CHROMOSOME 



FINE MAP 
LOCATION 



dbSNP HANDLE | LOCAL 
ASSAY ID SNPID 



10 



15 



20 



25 



30 



35 



40 



45 



15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 
15 



16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 
16 



247.4 
286.2 
306.5 
306.5 
308.4 
332.8 
332.8 
344.1 
355.1 
355.1 
363.7 
388.6 
396.8 
396.8 



cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 



of Chr15!i 
of Chr1 5 I 
of Chr15 
of Chr15 I 
of Chr15 I 
of Chrl 5 I 
of Chrl 5 li 
of Chr15 li 
of Chrl 5 I 
of Chr15lin 
of Chrl 5 lin 
of Chrl 5 lin 
of Chris lin 
of Chrl 5 lin 



50 16 



5.10 cR from top of Chrl 6 link 
5.60 cR from top of Chrl 6 link 
5.80 cR from top of Chrl 6 link 
9,00 cR from top of Chrl 6 link 
10.40 cR from top of Chrl 6 lin 
14.60 cR from top of Chrl 6 lin 
15.50 cR from top of Chrl 6 lin 
20.60 cR from top of Chrl 6 lln 

21.4 cR from top of Chr16 link 

22.1 cR from top of Chrl 6 link 

25.2 cR from top of Chrl 6 link 

25.5 cR from top of Chrl 6 link 
30.60 cR from top of Chrl 6 lin 

37.6 cR from top of Chrl 6 link 
41 .60 cR from top of Chrl 6 lin 
42.70 cR from top of Chrl 6 lin 
42,70 cR from top of Chrl 6 lin 
43.30 cR from top of Chrl 6 lin 



1053 
2369 
856 
2740 
3969 
2115 
2116 
2584 
2733 
2734 
3457 
2557 
1054 
2980 
4120 
4130 
4139 
4148 
4154 
3971 
4047 
3386 
3684 
3761 
1774 
1928 
3906 
3920 
3025 
3026 



1151 
3191 
1773 
693 
3024 
1404 
1889 
1223 
2710 
2766 
2481 
2891 
3550 
2760 
2036 
1485 
1489 
1649 



WIAF I WIAF-4162 
WIAF I WIAF-2630 
WIAF I WIAF-453 
WIAF I WIAF-454 
SHGC/AFFYMETRIX | 
WIAF I WIAF-243 
WIAF I WIAF-244 
WIAF I WIAF-218 
WIAF I WIAF-443 
WIAF I WIAF-444 
WIAF I WIAF-1912 
WIAF I WIAF-181 
WIAF I WIAF-4163 
WIAF I WIAF'880 
SHGC/AFFYMETRIX | 
SHGC/AFFYMETRIX | 
SHGC/AFFYMETRIX | 
SHGC/AFFYMETRIX j 
SHGC/AFFYMETRIX | 
SHGC/AFFYMETRIX | 
SHGC/AFFYMETRIX | 



SNP-SHGC-14665 



SNPA-SHGC-15063 

SNPB-SHGC-1 5063 

SNP^SHGC-13105 

SNP-SHGC-14096 

SNP-SHGC-14356 

SNP-SHGC'17150 

SNP-SHGC-9310 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



Wl AF-1 840 

WIAF-2412 

WIAF-2489 

WIAF-3713 

WIAF-3867 

WIAF-3980 

WIAF-4007 

WIAF-925 

WlAF-926 



WIAF-1628 

WIAF-1514 

WIAF-3712 

WIAF-1 244 

WIAF-924 

WIAF'-3324 

WIAF-382S 

WIAF-2110 

WIAF-403 

WIAF-506 

WIAF-77 

WIAF-789 

WIAF-2005 

WIAF-495 

WIAF-1 61 3 

WIAF-3424 

WIAF-3428 

WIAF-3588 
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1 6 44.0 cR from top of Chrl 6 link 

1 6 69.70 cR from top of Chrl 6 lin 

1 6 69.70 cR from top of Chrl 6 lin 

1 6 70.3 cR from top of Chrl 6 link 

5 1 6 72.60 cR from top of Chrl 6 lin 

1 6 74,20 cR from top of Chrl 6 lin 

1 6 75.80 cR from top of Chrl 6 lin 

1 6 86.50 cR from top of Chrl 6 lin 

1 6 87.50 cR from top of Chrl 6 lin 

10 16 91.50 cR from top of Chrl 6 lin 

16 91.50 cR from top of Chrl 6 lin 

1 6 92.40 cR from top of Chr1 6 lin 

1 6 97.90 cR from top of Chrl 6 lin 

1 6 98.0 cR from top of Chrl 6 link 

-15 16 98.10 cR from top of Chrl 6 lin 

1 6 98.20 cR from top of Chrl 6 lin 

16 103.10 cR from top of Chr16 I 

16 1 03. 1 0 cR from top of Chrl 6 li 

1 6 1 07.60 cR from top of Chrl 6 li 

20 16 1 09.20 cR from top of Chrl 6 I 

16 1 09.20 cR from top of Chrl 6 I 

1 6 1 09.20 cR from top of Chrl 6 I 

1 6 1 09.40 cR from top of Chr1 6 I 

16 1 1 2.3 cR from top of Chrl 6 lia 

25 16 1 13.8 cR from top of Chrl 6 lin 

16 113.9 cR from top of Chrl 6 lin 

16 11 3.9 cR from top of Chrl 6 lin 

16 1 19,10 cR from top of Chrl 6 li 

16 1 19.10 cR from top of Chrl 6 li 

30 16 122.1 cR from top of Chrl 6 lin 

1 6 1 23.30 cR from top of Chrl 6 li 

1 6 1 23.30 cR from top of Chrl 6 II 

1 6 1 30.80 cR from top of Chrl 6 li 

1 6 1 30.80 cR from top of Chrl 6 li 

35 16 131.4 cR from top of Chr1 6 lin 

16 1 40. 1 cR from top of Chrl 6 lin 

16 227.3 cR from top of Chrl 6 lin 

16 235.6 cR from top of Chrl 6 lin 

1 6 242.9 cR from top of Chrl 6 lin 

40 1 6 242.9 cR from top of Chrl 6 lin 

16 305.1 cR from top of Chrl 6 lin 

16 312.9 cR from top of Chrl 6 lin 

1 6 320.4 cR from top of Chrl 6 lin 

1 6 327.3 cR from top of Chrl 6 lin 

45 16 330.5 cR from top of Chrl 6 lin 

1 6 333.4 cR from top of Chrl 6 lin 

1 6 333.6 cR from top of Chrl 6 lin 

1 6 338.2 cR from top of Chrl 6 lin 

1 6 348.6 cR from top of Chr1 6 lin 

50 16 351.6 cR from top of Chrl 6 lin 



3021 
4566 
3261 
2770 
1249 

690 
1965 
1775 
1704 
1495 
1496 
3359 
3166 
2759 
3454 
1671 
1555 
1654 
3121 
1435 
1437 
1439 
2251 
2375 
3057 
3046 
3047 
1912 
1916 
2791 
2054 
2055 
1255 
1776 
2989 
2122 
3409 
3516 
2241 
2242 
1270 
3058 
2468 
3556 
3860 
2515 
2560 
2519 

892 
2146 



WIAF I WIAF- 
HU-CHINA 1 1 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF 
WIAF 
WIAF 
WIAF- 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF" 
WIAF- 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF-: 
WIAF 
WIAF 
WIAF 
WIAF-: 
WIAF-: 
WIAF 
WIAF 
WIAF 
WIAF-! 
WIAF-l 
WIAF- 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF 
WIAF 
WIAF- 
WIAF 
WIAF- 
WIAF 
WIAF- 
WIAF 
WIAF 



921 

6-1697 
1697 
510 
2136 
1210 
3904 
3714 
3643 
3434 
3435 
1812 
1474 
494 
1909 
3610 
3494 
3593 
1022 
3363 
3366 
3368 
697 
2652 
958 
947 
948 
3851 
3855 
546 
1717 
1718 
2142 
3715 
889 
285 
1864 
1971 
681 
682 
2157 
959 
61 

2011 
2650 
123 
184 
127 
1139 
437 
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10 



15 



16 351.6 cR from top of Chr16 lin 2147 
16 351.6 cR from top of Chr16 lin 2182 

16 4230 

16 3966 

16 4038 

16 1146 

16 3671 

16 2066 

16 3810 

16 1482 

16 1486 

16 1527 

16 1565 

16 2960 

16 2992 



WIAF I WIAF-438 
WIAF I WIAF-564 
MARSHFIELD | MID-23 
SHGC/AFFYMETRIX 1 SNP-SHGC-1 201 1 
SHGC/AFFYMETRIX | SNP-SHGC-81 52 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-1614 

WIAF-2399 

WIAF-2553 

WIAF-2562 

WIAF-3421 

WIAF-3425 

WIAF-3466 

WIAF-3504 

WIAF-859 

WIAF-892 



17 O.eOcRfromtopof Chr17link 2435 WIAF | 

1 7 0.60 cR from top of Chrl 7 link 3467 WIAF j 

20 17 0,60 cR from top of Chrl 7 link 1399 WIAF | 

17 1.40 cR from top of Chrl 7 link 1726 WIAF | 

17 2.10cRfromtopof Chrl 7 link 1108 WIAF 

17 4.50 cR from top of Chrl 7 link 31 1 5 WIAF 

17 5.90 cR from top of Chrl 7 link 3549 WIAF 

25 17 7.60 cR from top of Chrl 7 link 1741 WIAF 

17 7.60 cR from top of Chrl 7 link 2451 WIAF 

17 15.00 cR from top of Chrl 7 lin 1240 WIAF 

17 16.30 cR from top of Chrl 7 lin 1586 WIAF 

17 16,50 cR from top of Chrl 7 lin 1175 WIAF 

30 17 1 6.80 cR from top of Chrl 7 lin 2035 WIAF 

1 7 1 6.80 cR from top of Chrl 7 lin 1 721 WIAF 

17 1 6.80 cR from top of Chrl 7 lin 2460 WIAF 

17 19 cM 4313 UWGC 

17 29.30 cR from top of Chrl 7 lin 1812 WIAF 

35 1 7 33.5 cR from top of Chrl 7 link 2922 WIAF 

1 7 36.40 cR irom top of Chrl 7 lin 2553 WIAF 

1 7 45.20 cR from top of Chrl 7 lin 1 473 WIAF 

17 45.40 cR from top of Chrl 7 lin 1950 WIAF 

1 7 45.5 cR from top of Chrl 7 link 301 8 WIAF 

40 17 45.5 cR from top of Chrl 7 link 3041 WIAF 

17 51.8 cR from top of Chrl 7 link 3062 WIAF 

1 7 51 .90 cR from top of Chrl 7 lin 709 WIAF 

17 53.10cRfromtopof Chr17lin 2018 WIAF 

1 7 57.30 cR from top of Chrl 7 lin 3541 WIAF 

45 17 59.9 cRfromtopof Chrl 7 link 2846 WIAF 

17 60.10 cR from top of Chrl 7 lin 1258 WIAF 

1 7 60.60 cR from top of Chrl 7 lin 1 441 WIAF 

17 ei.lOcRfromtopof Chr17lin 1696 WIAF 

17 62.8 cR from top of Chrl 7 link 2564 WIAF 

50 1 7 62.80 cR from top of Chrl 7 lin 3387 WIAF 



WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

I 132 

WIAF- 

WIAF- 

W1AF- 

WIAF- 

WIAF- 

WIAF. 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 

WIAF- 



18 

1922 
3305 
3665 
1540 
1016 
2004 
3680 
39 

2127 
3525 
1699 
1598 
3660 
51 

3751 

821 

176 

3410 

3889 

918 

942 

963 

1419 

1519 

1996 

699 

2145 

3370 

3635 

193 

1841 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 



63.10 cR from top of Chr17 lin 
63.40 cR from top of Chr17 lin 
64.80 cR from top of Chrl 7 lin 
65.00 cR from top of Chrl 7 lin 
65.00 cR from top of Chrl 7 lin 
66.10 cR from top of Chrl 7 !in 
67.00 cR from top of Chrl 7 lin 
68.10 cR from top of Chrl 7 lin 

73.00 cR from top of Chrl 7 lin 
83.90 cR from top of Chrl 7 lin 
84.1 0 cR from top of Chrl 7 lin 
84.90 cR from top of Chr1 7 lin 
84.90 cR from top of Chrl 7 lin 
86.30 cR from top of Chrl 7 lin 
86.70 cR from top of Chrl 7 lin 
87.60 cR from top of Chrl 7 lin 

94.1 cR from top of Chr17 link 
94.1 cR from top of Chrl 7 link 
94.1 cR from top of Chrl 7 link 

103.5 cR from top of Chrl 7 lin 

250.6 cR from top of Chrl 7 lin 

304.7 cR from top of Chrl 7 lin 
307.9 cR from top of Chrl 7 lin 
311.1 cR from top of Chrl 7 lin 
317.4 cR from top of Chrl 7 lin 

329.4 cR from top of Chrl 7 lin 
338.1 cR from top of Chrl 7 lin 
338.6 cR from top of Chrl 7 lin 
355.3 cR from top of Chrl 7 lin 

355.5 cR from top of Chrl 7 lin 
371 .5 cR from top of Chrl 7 lin 
445.5 cR from top of Chrl 7 lin 
462.1 cR from top of Chrl 7 lin 



3183 
684 
1997 
1840 
1841 
3453 
1637 
3242 
3350 
3397 
1094 
700 
673 
671 
1416 
1898 
2278 
2279 
2280 
2087 
3008 
2975 
3288 
2844 
2858 
2861 
2869 
2567 
2767 
3000 
3821 
2842 
3570 
4053 
4059 
4001 
4006 
4072 
4092 
4165 
4095 
4015 
3120 
3127 
2550 
1335 
3678 
3777 
3778 
3800 



WIAF I WiAF-1499 
WIAF I WIAF-1108 
WIAF I WIAF-3936 
WIAF 1 WIAF-3779 
WIAF I WIAF-3780 
WIAF I WIAF-1908 
WIAF I WIAF-3576 
WIAF I WIAF-1652 
WIAF I WIAF-1803 
WIAF I WIAF-1852 
WIAF ] WIAF-1479 
WIAF I WIAF-1276 
WIAF I WIAF-1376 
WIAF j WIAF-1361 
WIAF I WIAF-3343 
WIAF 1 WIAF-3837 
WIAF 1 WIAF-746 
WIAF 1 WIAF-747 
WIAF I WIAF-748 
WIAF I WIAF-101 
WIAF I WlAF-908 
WIAF 1 WIAF-875 
WIAF I WIAF-1741 
WIAF I WIAF-688 
WIAF I WIAF-752 
WIAF I WIAF-758 
WIAF I WIAF-767 
WIAF I WIAF-196 
WIAF I WIAF.507 
WIAF I WIAF-900 
WIAF I WIAF-2582 
WIAF I WIAF-684 
WIAF I WIAF-2025 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX 



SNPA-SHGC-31580 

SNPB-SHGC-31580 

SNP-SHGC-1216 

SNP-SHGC-1310 

SNP-SHGC-14793 

SNP-SHGC-17275 

SNP-SHGC-18143 

SNP-SHGC-18839 

SNP-SHGC-3939 



WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF-1021 

WIAF-1028 

WIAF-171 

WIAF-2225 

WIAF-2406 

WIAF-2505 

WIAF-2506 

WIAF-2529 
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17 
17 



2463 
3073 



WIAF I WIAF-55 
WIAF I WIAF-974 



10 



15 



20 



25 



30 



35 



40 



45 



1 8 7.40 cR from top of Chrl 8 link 201 1 WIAF | WlAF-1 505 

18 7.40 cR from top of Chrl 8 link 2012 WIAF | WIAF-1506 

18 7.90 cR from top of Chrl 8 link 1 189 WIAF | WIAF-2076 

18 19.5 cR from top of Chrl 8 link 3834 WIAF | WIAF-2603 

18 20.90 cR from top of Chrl 8 Hn 3226 WIAF j WIAF-1626 

18 21 .1 cR from top of Chrl 8 link 2820 WIAF | WIAF-621 

18 28.1 cR from top of Chrl 8 link 3848 WIAF j WIAF-2631 

18 32.1 cR from top of Chrl 8 link 2819 WIAF j WIAF-620 

18 35.0 cR from top of Chris link 4584 HU-CHINA | 18-525 

18 35.0 cR from top of Chrl 8 link 4557 HU-CHINA | 18-525-2 

18 35.0 cR from top of Chrl 8 link 2163 WIAF | WIAF-525 

18 35.0 cR from top of Chrl 8 link 2164 WIAF | WIAF-526 

18 36.20 cR from top of Chr18 lin 3355 WIAF 1 WIAF-1808 

18 36.20 cR from top of Chrl 8 lin 3356 WIAF j WIAF-1809 

18 43.10 cR from top of Chrl 8 lin 1250 WIAF | WIAFt2137 

18 45.4 cR from top of Chrl 8 link 2587 WIAF | WIAF-222 

IB 45.6 cR from top of Chrl 8 link 3101 WIAF | WIAF-1002 

18 52.5 cR from top of Chrl 8 link 3027 WIAF | WIAF-927 

18 56.2 cR from top of Chrl 8 link 3278 WIAF | WIAF-1731 

18 56.2 cR from top of Chr18 link 3279 WIAF | WIAF-1732 

18 56.2 cR from top of Chrl 8 link 3280 WIAF j WIAF-1733 

18 57.00 cR from top of Chrl 8 lin 1940 WIAF | WIAF-3879 

18 58.1 cR from top of Chr18 link 3100 WIAF | WIAF-1001 

18 61.50 cR from top of Chrl 8 lin 1127 WIAF | WIAF-1571 

18 61.50 cR from top of Chrl 8 lin 1128 WIAF | WIAF-1572 

18 61.60 cR from top of Chrl 8 lin 3164 WIAF | WlAF-1468 

18 66.60 cR from top of Chrl 8 lin 2486 WIAF | WIAF-84 

1 8 66.70 cR from top of Chrl 8 lin 1 501 WfAF | WIAF-3440 

18 66.70 cR from top of Chrl 8 lin 1719 WIAF | WIAF-3658 

18 68.20 cR from top of Chrl 8 lin 2007 WIAF | WIAF-1471 

18 72.30 cR from top of Chrl 8 lin 3322 WIAF j WIAF-1775 

18 80.30 cR from top of Chrl 8 lin 1596 WIAF | WIAF-3535 

18 81 .60 cR from top of Chrl 8 lin 1697 WIAF j WIAF-3636 

18 81 .60 cR from top of Chrl 8 lin 1700 WIAF | WIAF-3639 

18 109.00 cR from top of Chrl 8 11 2918 WIAF | W'lAF-817 

18 202.8 cR from top of Chrl 8 lin 2436 WIAF j WIAF-21 

18 288.2 cR from top of Chrl 8 lin 989 WIAF | WIAF-1432 

18 288.2 cR from top of Chr18 lin 1016 WIAF | WIAF-4064 

18 321.0 cRfromtopof Chris lin 3358 WIAF j WIAF-181 1 

18 323.9 cR from top of Chrl 8 lin 2781' WIAF | WlAF-530 

1 8 337.2 cR from top of Chrl 8 lin 2093 WIAF | WIAF-1 1 2 

18 355.2 cR from top of Chris lin 910 WIAF j WIAF-1 187 

18 355.2 cR from top of Chrl 8 lin 911 WIAF j WIAF-1 188 

18 394.1 cR from top of ChrlS iin 2282 WIAF j WIAF-753 

18 454.4 cR from top of Chrl 8 lin 2109 WIAF | WIAF-21 0 

18 455.8 cR from top of Chrl 8 lin 2252 WIAF j WIAF-700 

18 465.8 cR from top of Chrl 8 lin 3029 WIAF j WIAF-929 
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19 88.00 cR from top of Chr19 tin 1628 WIAF | WIAF-3567 

19 89.8 cR from top of Chr19 link 3039 WIAF j WIAF-940 

19 95.40 cR from top of Chr19 lin 1723 WIAF | WIAF-3662 

19 97.60 cR from top of Chr19 lin 1376 WIAF j WIAF-3280 

5 19 97.90 cR from top of Chr19 lin 1668 WIAF j WIAF-3607 

1 9 99.80 cR from top of Chrl 9 lin 11 72 WIAF j WIAF-1 689 

19 100.30 cR from top of Chrl 9 11 1214 WIAF | WIAF-2101 

19 103.60 cR from top of Chrl 9 li 1964 WIAF j WIAF-3903 

19 104.90 cR from top of Chrl 9 II 2427 WIAF \ WIAF-7 

10 19 106.70cRfromtopof Chr19ll 1584 WIAF j WIAF-3523 

19 107.40 cR from top of Chrl 9 11 1487 WIAF | WIAF-3426 

19 109.90 cR from top of Chrl 9 li 3534 WIAF | WIAF-1 989 

19 109.90 cR from top of Chrl 9 li 1476 WIAF | WIAF-3414 

19 256.4 cR from top of Chrl 9 lin 2343 WIAF | WIAF-2574 

15 19 280.9 cR from top of Chrl 9 lin 3493 WIAF | WIAF-1 948 

19 285.1 cR from top of Chrl 9 lin 2888 WIAF | WIAF-786 

19 286.2 cR from top of Chrl 9 lin 2617 WIAF | WIAF-264 

19 290.4 cR from top of Chrl 9 lin 2731 WIAF | WIAF-439 

19 290.7 cR from top of Chrl 9 lin 2717 WIAF | WIAF-416 

20 19 324.0 cR from top of Chr19 lin 35.82 WIAF | WIAF.2037 

19 324.0 cR from top of Chrl 9 lin 3077 WIAF j WIAF-978 

19 325.6 cR from top of Chrl 9 lin 3854 WIAF | WIAF-2641 

19 331 .1 cR from top of Chrl 9 lin 2423 WIAF | WIAF-1 

1 9 331 .2 cR from top of Chrl 9 lin 2832 WIAF j WIAF-654 

25 1 9 341 .9 cR from top of Chrl 9 lin 3045 WIAF | WIAF-946 

19 349.3 cR from top of Chrl 9 lin 4591 HU-CHINA I 19-941 

19 349.3 cR from top of Chrl 9 lin 4592 HU-CHINA | 19-941-2 

1 9 349.3 cR from top of Chrl 9 lin 3040 WIAF | WIAF-941 

19 380.7 cR from top of Chrl 9 lin 3867 WIAF | WIAF-2662 

30 1 9 382.3 cR from top of Chrl 9 lin 3372 WIAF | WIAF-1 825 

1 9 382.3 cR from top of Chrl 9 lin 3373 WIAF j WtAF-1 826 

1 9 382.8 cR from top of Chr1 9 lin 3851 WIAF | WIAF-2637 

19 385.1 cR from top of Chrl 9 lin 3485 WIAF | WIAF-1 940 



19 4054 SHGC/AFFYMETRIX | SNPA-SHGC-35310 

35 19 4060 SHGC/AFFYMETRIX | SNPB-SHGC-35310 

19 3957 SHGC/AFFYMETRIX j SNPB-SHGC-9656 

19 3963 SHGC/AFFYMETRIX I SNP-SHGC-1 1607 

19 4068 SHGC/AFFYMETRIX | SNP-SHGC-1 3495 

19 3174 WIAF I WlAF-1488 

40 19 2038 WIAF | WIAF-1 61 8 

19 3239 WIAF | WIAF-1 649 

19 * 3253 WIAF | WIAF-1 671 

19 3787 WIAF | WIAF-2515 

19 1740 WIAF I WIAF-3679 

45 19 2952 WIAF j WIAF-851 

19 2985 WIAF | WIAF-885 

19 2986 WIAF | WIAF-886 

19 2993 WIAF j WIAF-893 

19 2994 WIAF I WIAF-894 

50 19 3023 WIAF I WIAF-923 
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3071 WIAF I WIAF-972 



20 7.10 cR from top of Chr20 link 1242 WIAF ] WIAF-2129 

20 8.20 cR from top of Chr20 link 2057 . WIAF | WIAF-1720 

5 20 9.30 cR from top of Chr20 link 1842 WIAF \ WIAF-3781 

20 9.40 cR from top of Chr20 link 1 232 WIAF j WIAF-21 1 9 

20 9.80 cR from top of Chr20 link 1 1 25 WIAF | WIAF-1 568 

20 9.80 cR from top of Chr20 link 1 1 26 WIAF j WIAF-1 569 

20 9.80 cR from top of Chr20 link 3344 WIAF ) WlAF-1797 

10 20 10.1 cR from top of Chr20 link 2856 WIAF j WIAF-749 

20 10.10 cR from top of Chr20 lin 2494 WIAF j WIAF-92 

20 14.7 cR from top of Chr20 link 2432 WIAF | WIAF-1 5 

20 22.00 cR from top of Chr20 lin 1880 WIAF j WIAF-3819 

20 23.2 cR from top of Chr20 link 3551 WIAF | WIAF-2006 

1 5 20 24,7 cR from top of Chr20 link 2745 WIAF | WIAF-464 

20 24.7 cR from top of Chr20 link 2746 WIAF j WIAF-465 

20 25.6 cR from top of Chr20 link 2851 WIAF | WIAF-730 

20 30.60 cR from top of Chr20 lin 3256 WIAF j WIAF-1 684 

20 32.60 cR from top of Chr20 lin 1570 WIAF | WIAF-3509 

20 20 32.60 cR from top of Chr20 lin 1572 WIAF | WIAF-3511 

20 35.10 cR from top of Chr20 lin 1650 WIAF | WlAF-3589 

20 36.80 cR from top of Chr20 lin 2040 WIAF j WlAF-1621 

20 39.90 cR from top of Chr20 lin 1 241 WIAF j WIAF-21 28 

20 41 .20 cR from top of Chr20 lin 2002 WIAF | WIAF-3941 

25 20 41 .40 cR from top of Chr20 lin 2000 WIAF | WIAF-3939 

20 41 .60 cR from top of Chr20 lin 1 225 . WIAF | WIAF-21 1 2 

20 41 .60 cR from top of Chr20 lin 2713 WIAF | WIAF-410 

20 41 .70 cR from top of Chr20 lin 1786 WIAF | WlAF-3725 

20 42,20 cR from top of Chr20 lin 1988 WIAF | WIAF-3927 

30 20 42.70 cR from top of Chr20 lin 3013 WIAF j WIAF-913 

20 47.80 cR from top of Chr20 lin 2887 WIAF 1 WIAF-785 

20 48.70 cR from top of Chr20 lin 1490 WIAF | WIAF-3429 

20 49 cM 4325 UWGC i 144 

20 53.00 oR from top of Chr20Jin 2896 WIAF | WIAF-794 

35 20 55.00 cR from top of Chr20 lin 1 1 81 WIAF | WIAF-1 71 1 

20 55.00 cR from top of Chr20 lin 2626 WIAF | WIAF-286 

20 55.40 cR from top of Chr20 lin 1758 WIAF j WIAF-3697 

20 62.40 cR from top of Chr20 lin 2009 WIAF | WIAF-1 481 

20 63.30 cR from top of Chr20 lin 3564 WIAF j WIAF-2019 

40 20 65.30 cR from top of Chr20 lin 3398 WIAF j WIAF-1 853 

20 74.00 cR from top of Chr20 lin 1099 WIAF [ WIAF-1 51 5 

20 74.00 cR from top of Chr20 lin 1875 WIAF j WIAF-3814 

20 82.10 cR from top of Chr20 lin 3514 WIAF j WlAF-1969 

20 82.50 cR from top of Chr20 lin 1537 WIAF j WIAF-3476 

45 20 82.80 cR from top of Chr20 lin 1978 WIAF j WlAF-3917 

20 86.00 cR from top of Chr20 lin 1745 WIAF | WIAF-3684 

20 88.30 cR from top of Chr20 lin 1507 WIAF | WlAF-3446 

20 89.5 cR from top of Chr20 link 2083 WIAF | WIAF-67 

20 91 .2 cR from top of Chr20 link 21 30 WIAF | WIAF-333 

50 20 96.50 cR from top of Chr20 lin 1998 WIAF | WIAF-3937 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 
20 



96.50 
106.6 
310.1 
316.2 
318.0 
318.9 
319.9 
320.2 
320.6 
334.7 
334.7 
334.7 
343.6 
343.6 
343.6 
343.6 



cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 
cR from top 



of Chr20 (in 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Clir20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 
of Chr20 lin 



30 cR from top of Chr21 lin 
30 cR from top of ehr21 lin 
60 cR from top of Chr21 lin 
50 cR from top of Chr21 lin 
8 cR from top of Chr21 link 
50 cR from top of Chr21 lin 
10 cR from top of Chr21 lin 
30 cR from top of Chr21 lin 
20 cR from top of Chr21 lin 
30 cR from top of Chr21 lin 
30 cR from top of Chr21 lin 
00 cR from top of Chr21 lin 
40 cR from top of Chr21 lin 
,40 cR from top of Chr21 lin 
.30 cR from top of Chr21 lin 
.30 cR from top of Chr21 lin 
.30 cR from top of Chr21 lin 
30 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 59.60 cR from top of Chr21 lin 
21 1 1 9.1 cR from top of Chr21 lin 



21 11. 

21 11, 

21 17< 

21 30. 

21 31, 

21 32. 

21 35. 

21 35 

21 38 

21 38 

21 39 

21 45 

21 57 

21 57 

21 58 

21 58 

21 59 

21 59 



2971 
2501 
2283 
2441 
2628 
3365 
3540 
2082 
771 
4577 
763 
822 
2871 
2872 
2873 
2874 
4086 
4009 
4010 
4033 
3238 
3699 
3052 

1558 
1559 
1623 
1606 
2243 
1856 
1557 
1569 
3488 
3197 
1718 
1547 
3200 
3201 
1540 
1543 
1324 
1325 
4570 
689 
1166 
1746 
1748 
1802 
2999 
2206 



WIAF I WIAF-871 
WIAF 1 WIAF-100 
WIAF I WIAF-756 
WIAF I WIAF-26 
WIAF I WIAF-288 
WIAF i WIAF-1818 
WIAF I WIAF-1995 
WIAF I WIAF-42 
WIAF I WIAF-1402 
HU-CHINA I 20-1357 
WIAF I WIAF-1357 
WIAF I WIAF-4221 
WIAF 1 WlAF-769 
WIAF I WIAF-770 
WIAF I WIAF-771 
WIAF I WIAF-772 
SHGC/AFFYMETRIX 
SHGC/AFFYMETRIX | SNP-SHGC-2774 
SHGC/AFFYMETRIX | SNP-SHGC-2775 
SHGC/AFFYMETRIX 



SNP-SHGC-16962 



SNP-SHGC-6179 



WIAF 
WIAF 
WIAF 

WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
HU-l 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



CHINA 



WIAF-1648 
WIAF-2427 
wiAF-953 

WIAF-3497 
WIAF-3498 
WIAF-3562 
WIAF-3545 
WIAF-683 
WIAF-3795 
WIAF-3496 
WIAF-3508 
WIAF-1943 
WIAF-1 534 
WIAF-3657 
WIAF-3486 
WIAF-1 537 
WIAF-1 538 
WIAF-3479 
WIAF-3482 
WIAF-2214 
WIAF-2215 
I 21-899 
WIAF-1 199 
WIAF-1 678 
WIAF-3685 
WIAF-3687 
WIAF-3741 
WIAF-899 
WIAF.624 
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3 cR from top of Chr21 lin 
3 cR from top of Chr21 lin 
0 cR from top of Chr21 lin 
7 cR from top of Chr21 lin 



21 153. 
21 153. 
21 157. 
21 233. 
5 21 
21 
21 
21 
21 

10 21 
21 
21 
21 
21 

15 21 
21 
21 
21 
21 

20 21 
21 
21 
21 



25 X 1 .7 cR from top of ChrX llnkag 

X 7.0 cR from top of ChrX linkag 

X 7.4 cR from top of ChrX linkag 

X 1 1 .1 0 cR from top of ChrX link 

X 1 1 .10 cR from top of ChrX link 

30 X 1 5.0 cR from top of ChrX linka 

X 1 5.0 cR from top of ChrX linka 

X 1 5.7 cR from top of ChrX linka 

X 23.5 cR from top of ChrX linka 

X 28.10 cR from top of ChrX link 

35 X 28.10 cR from top of ChrX link 

X 30.1 0 cR from top of ChrX link 

X 41 ,4 cR from top of ChrX linka 

X 43.4 cR from top of ChrX linka 

X 43.4 cR from top of ChrX linka 

40 X 59.80 cR from top of ChrX link 

X 91 .6 cR from top of ChrX linka 

X 91.6 cR from top of ChrX linka 

X 91.8 cR from top of ChrX linka 

X 92.20 cR from top of ChrX link 

45 X 96,80 cR from top of ChrX link 

X 120.30 cR from top of ChrX lin 

X 1 87.70 cR from top of ChrX lin 

X 1 87.70 cR from top of ChrX lin 

X 232.1 cR from top of ChrX link 

50 X 282.4 cR from top of ChrX link 



3855 
3859 
2434 
3438 
3989 
4141 
4011 
4211 
4212 
4213 
4214 
3982 
4215 
4216 
4030 
4217 
4031 
3184 
1170 
1171 
3402 
3427 
1212 

2384 
2188 
2096 
1293 
1294 
3870 
3075 
2148 

773 
2573 
1244 
1833 
3303 
2803 
2804 
3486 
2608 
2609 
3408 
3417 
3548 

705 
3290 
3291 
2257 

761 



WIAF I WIAF-2643 

WIAF I WIAF-2648 

WIAF I WIAF-17 

WIAF I WIAF-1893 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 

SHGC/AFFYMETRIX 



WIAF 

WIAF 
WIAF 
WIAF 
WIAF 
WIAF 

WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 
WIAF 



WIAF- 1500 

WIAF-1682 
WIAF-1683 
WIAF-1857 
WIAF-1882 
WIAF-2099 



WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF" 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 
WIAF- 



2675 

576 

137 

2180 

2181 

2669 

976 

440 

1404 

203 

2131 

3772 

1756 

572 

573 

1941 

254 

255 

1863 

1872 

2003 

1356 

1743 

1744 

707 

1350 



SNPA-SHGC-9556 

SNP-SHGC-13352 

SNP-SHGC-2811 

SNP-SHGC-51813 

SNP-SHGC-51844 

SNP-SHGC-51849 

SNP-SHGC-51852 

SNP-SHGC-51888 

SNP-SHGC-51907 

SNP.SHGC-51925 

SNP.SHGC-51941 

SNP-SHGC-51944 

SNP-SHGC-51951 
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CHROMOSOME LOCATION ASSAY ID SNP ID 



10 



15 



20 



25 



30 



35 



40 



45 



50 



X 


285.4 cR from top of ChrX 1 


nk 3030 


WIAF 1 


WIAF-930 


X 


290.1 cR from top of ChrX 1 


nk 


2072 


WIAF 1 


WIAF-2 


X 


294.0 cR from top of ChrX 1 


nk 


2778 


WIAF 1 


WIAF-522 


X 


301 .2 cR from top of ChrX 1 


nk 


2776 


WIAF 1 


WIAF-519 


X 


301 .3 cR from top of ChrX 1 


nk 


871 


WIAF 1 


WIAF-1098 


X 


301 .3 cR from top of ChrX 1 


nk 


872 


WIAF 1 


WIAF-1099 


X 


304.8 cR from top of ChrX 1 


nk 


3824 


WIAF 1 


WIAF-2589 


X 


31 5.6 cR from top of ChrX I 


nk 


3599 


WIAF 1 


WIAF-2269 


X 


31 5.6 cR from top of ChrX I 


nk 3600 


WIAF 1 


WIAF-2270 


X 


331 .3 cR from top of ChrX 1 


nk 3104 


WIAF 1 


WIAF-1005 


X 


Xp21.2, 32.550 Mb 




625 


KWOK 


1 Xpl 226-1 


X 


Xp21.2, 32.550 Mb 




624 


KWOK 


1 Xpl 226-2 


X 


Xq22, 105.900 Mb 




623 


KWOK 


1 Xql 136-1 


X 


Xq22, 106.300 Mb 




661 


KWOK 


1 Xq544-1 


X 


Xq22, 106.300 Mb 




660 


KWOK 


1 Xq544-2 


X 


Xq24, 117.624 Mb 




641 


KWOK 


1 Xq3562-1 


X 


Xq24, 117.685 Mb 




646 


KWOK 


1 Xq3655-2 


X 


Xq24, 117.754 Mb 




648 


KWOK 


1 Xq3656-1 


X 


Xq24, 117.754 Mb 




649 


KWOK 


1 Xq3656-2 


X 


Xq24, 117.754 Mb 




647 


KWOK 


1 Xq3656-3 


X 


Xq25 






590 


KWOK 


1 Xq3855-1 


X 


Xq25 






591 


KWOK 


1 Xq3858-1 


X 


Xq25, 


122.820 Mb 




588 


KWOK 


1 Xq3847-1 


X 


Xq25, 


124,300 Mb 




593 


KWOK 


1 Xq3868-1 


X 


Xq25, 


124.362 Mb 




654 


KWOK 


1 Xq3773-1 


X 


Xq25, 


124.456 Mb 




655 


KWOK 


1 Xq3774-1 


X 


Xq25, 


124.456 Mb 




656 


KWOK 


i Xq3774-2 


X 


Xq25, 


124.673 Mb 




592 


KWOK 


1 Xq3862-1 


X 


Xq25, 


124.860 Mb 




645 


KWOK 


1 Xq3570-1 


X 


Xq25, 


124.860 Mb 




644 


KWOK 


1 Xq3570-2 


X 


Xq25, 


:^ 24.860 Mb 




642 


KWOK 


I Xq3570-3 


X 


Xq25, 


124.860 Mb 




643 


KWOK 


1 Xq3570-4 


X 


Xq25, 


125.110 




621 


KWOK 


1 Xq38l3-1 


X 


Xq25, 


125.110 




622 


KWOK 


1 Xq3813-2 


X 


Xq25, 


126.091 Mb 




587 


KWOK 


1 Xq3846-1 


X 


Xq25, 


126.091 Mb 




586 


KWOK 


1 Xq3846-2 


X 


Xq25, 


126.257 




617 


KWOK 


1 Xq3705-1 


X 


Xq25, 


126.257 




618 


KWOK 


1 Xq3705-2 


X 


Xq25, 


126.296 Mb 




657 


KWOK 


1 Xq3804-1 


X 


Xq25, 


126.387 




619 


KWOK 


1 Xq3812-1 


X 


Xq25, 


126.387 




620 


KWOK 


1 Xq3812-2 


X 


Xq25, 


126.687 Mb 




589 


KWOK 


1 Xq3849-1 


X 


Xq25, 


126.846 Mb 




652 


KWOK 


1 Xq3699-1 


X 


Xq25, 


126.846 Mb 




653 


KWOK 


1 Xq3699-2 


X 


Xq25, 


126.954 Mb 




658 


KWOK 


1 Xq3811-1 


X 


Xq25, 


126.954 Mb 




659 


KWOK 


1 Xq3811-2 


X 


Xq25, 


127.053 Mb 




650 


KWOK 


1 Xq3698-1 


X 


Xq25, 


127.053 Mb 




651 


KWOK 


1 Xq3698-2 


X 


Xq26 






594 


KWOK 


1 Xq3871-1 


X 


Xq26, 131.172 Mb 




597 


KWOK 


1 Xq3879-1 
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ASSAY ID 


SNP ID 




X 


Xq26, 131.990 Mb 


634 


KWOK 


1 Xq3070-1 




X 


Xq26, 131.990 Mb 


635 


KWOK 


] Xq3070-2 




X 


Xq26, 136.541 Mb 


662 


KWOK 


I Xq3874-1 




X 


Xq26/ 136.631 Mb 


595 


KWOK 


1 Xq3875-1 


5 


X 


Xq26, 136.631 Mb 


596 


KWOK 


1 Xq3875-2 




X 


Xq26, 137.977 


608 


KWOK 


1 Xq3695-1 




X 


Xq26, 137.977 


609 


KWOK 


1 Xq3695-2 




X 


Xq26, 137.977 


611 


KWOK 


1 Xq3695-3 




X 


Xq26, 137.977 


610 


KWOK 


1 Xq3695-4 


10 


X 


Xq26, 138.062 


612 


KWOK 


1 Xq3696-1 




X 


Xq26, 138.062 


613 


KWOK 


1 Xq3696-2 




X 


Xq26, 138.062 


614 


KWOK 


1 Xq3696-3 




X 


Xq26, 138.062 


615 


KWOK 


1 Xq3696-4 




X 


Xq26, 138.062 


616 


KWOK 


1 Xq3696-5 


15 


X 


Xq27 


598 


KWOK 


1 Xq3885-1 




X 


Xq27 


599 


KWOK 


1 Xq3885-2 




X 


Xq27 


600 


KWOK 


1 Xq3886-1 




X 


Xq27 


601 


KWOK 


1 Xq3886-2 




X 


Xq27, 141.250 Mb 


631 


KWOK 


1 Xq2904-1 


20 


X 


Xq27, 141.250 Mb 


632 


KWOK 


1 Xq2904-2 




X 


Xq27, 141.250 Mb 


633 


KWOK 


1 Xq 2904-3 




X 


Xq27, 141.499 Mb 


663 


KWOK 


1 Xq3887-1 




X 


Xq27, 141.499 Mb 


664 


KWOK 


1 Xq3887-2 




X 


Xq27, 141.580 Mb 


602 


KWOK 


1 Xq3888-1 


25 


X 


Xq28 


. 603 


KWOK 


i Xq3555-1 




X 


Xq28 


604 


KWOK 


1 Xq3555-2 




X 


Xq28 


605 


KWOK 


1 Xq3555-3 




X 


Xq28 


606 


KWOK 


1 Xq3555-4 




X 


Xq28 


607 


KWOK 


1 Xq3555-5 


30 


X 


Xq28, 157.074 Mb 


585 


KWOK 


1 Xq3841-1 




X 


Xq28, 157.123 Mb 


583 


KWOK 


1 Xq3840-1 




X 


Xq28, 157.123 Mb 


584 


KWOK 


i Xq3840-2 




X 


Xq28, 157.939 Mb 


640 


KWOK 


1 Xq3476-1 




X 


Xq28, 157.939 Mb 


639 


KWOK 


1 Xq3476-2 


35 


X 


Xq28, 158.055 Mb 


637 


KWOK 


1 Xq 3449-1 




X 


Xq28, 158.059 Mb 


638 


KWOK 


1 Xq3471-1 




X 


Xq28, 158.237 Mb 


630 


KWOK 


1 Xq2816-1 




X 


Xq28, 158.265 Mb 


636 


KWOK 


1 Xq3274-1 




X 


Xq28, 158.490 Mb 


626 


KWOK 


1 Xq1452.1 


40 


X 


Xq28, 158.490 Mb 


627 


KWOK 


1 Xql 452-2 




X 


Xq28, 158.490 Mb 


628 


KWOK 


1 Xql 452-3 




X 


Xq28, 158.490 Mb 


629 


KWOK 


1 Xql 452-4 




X 




4099 


SHGC/AFFYMETRIX 




X 




2008 


WIAF 1 


WIAF-1472 


45 


X 




3271 


WIAF 1 


WIAF-1723 




X 




3272 


WIAF 1 


WIAF-1724 




X 




3469 


WIAF 1 


WIAF-1924 




X 




3602 


WIAF i 


WIAF-2274 




X 




3869 


WIAF 1 


WIAF-2666 


50 


X 




3036 


WIAF 


WIAF-936 



I SNP-SHGC-18945 
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CHROMOSOME 
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ASSAY ID SNP ID 




Y 






3930 


OEFNER 


1 M2 




Y 






3930 


OEFNER 


1 M2 




Y 






3931 


OEFNER 


1 M3 




Y 






3931 


OEFNER 


1 M3 


5 


Y 






3932 


OEFNER 


1 M4 




Y 






3933 


OEFNER 


1 M5 




Y' 






3933 


OEFNER 


1 M5 




Y 






3934 


OEFNER 


1 M6 




Y 






3935 


OEFNER 


1 M7 


10 


Y 






3936 


OEFNER 


1 M8 




. Y 






3937 


OEFNER 


1 M9 




Y 






3938 


OEFNER 


1 M10 




Y 






3939 


OEFNER 


1 Mil 




Y 






3940 


OEFNER 


1 M12 


15 


Y 






3941 


OEFNER 


1 M13 




V 

T 








UtrNtn 


^y1 1 A 

j IVI 1 H- 




Y 






3943 


OEFNER 


1 M15 




Y 






3944 


OEFNER 


1 M16 




Y 






3944 


OEFNER 


1 M16 


20 


. Y 






3945 


OEFNER 


i M17 




Y 






3946 


OEFNER 


1 M18 




Y 






3947 


OEFNER 


1 M19 




Y 






3948 


OEFNER 


1 M20 




Y 






3949 


OEFNER 


1 M21 


25 


Y 






3950 


OEFNER 


1 M22 




E. 


METHODS FOR REMOVING NUCLEIC ACID 



ABNORMAL BASE-PAIRING 



30 Provided herein is a method for removing a nucleic acid duplex 

containing one or more abnormal base-pairing in a population of nucleic 
acid duplexes, which method comprises: a) contacting a population of 
nucleic acid duplexes having or suspected of having a nucleic acid duplex 
containing one or more abnormal base-pairing with a mutant DNA repair 

35 enzyme or complex thereof, wherein the mutant DNA repair enzyme or 
complex thereof has binding affinity for the abnormal base-pairing in the 
duplex but has attenuated catalytic activity and whereby the nucleic acid 
duplex containing one or more abnormal base-pairing binds to the mutant 
DNA repair enzyme or complex thereof to form a binding complex; and b) 

40 removing the binding complex formed in step a) from the population of 
nucleic acid duplexes, thereby the nucleic acid duplex containing one or 
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more abnormal base-pairing is removed from the population of nucleic 
acid duplexes. 

In a specific embodiment, a population of nucleic acid duplexes 
comprise DNA:DNA, DNArRNA and RNArRNA duplexes. Preferably, the 
5 population comprises DNA:DNA duplexes. 

In another specific embodiment, the nucleic acid duplex to be 
removed from the population comprise a base-pair mismatch, a base 
insertion, a base deletion or a pyrimidine dimer. Preferably, the base-pair 
mismatch is a single base-pair mismatch. 

1 0 In still another specific embodinient, the population of nucleic acid 

duplexes is produced by an enzymatic amplification. Preferably, the 
population of nucleic acid duplexes is produced by a polymerase chain 
reaction or a reaction utilizing reverse transcription and subsequent DNA 
amplification of one or more expressed RNA sequences. 

1 5 The binding complex formed between the nucleic acid duplex 

containing one or more abnormal base-pairing and the mutant DNA repair 
enzyme or complex thereof can be removed from the population of 
nucleic acid duplexes by any methods known in the art. For example, the 
binding complex can be separated from the population by conventional 

20 separation methods such as electrophoresis, centrifugation, filtration and 
chromatograph. The separation can also be effected by affinity 
separation/purification, i.e., using moieties that bind proteins but not 
nucleic acids. For example, antibodies that bind proteins generally but 
not nucleic acids can be used, antibodies that specifically bind the mutant 

25 DNA repair enzyme or cornplex thereof can be used. In addition, the 

mutant DNA repair enzyme or complex thereof can be labelled and/or 

tagged and the separation can be effected through the labels or tags. 

F. METHODS FOR DETECTING AND LOCALIZING ABNORMAL BASE- 
PAIRING IN NUCLEIC ACID DUPLEX 

30 

Also provided herein is a method for detecting and localizing an 
abnormal base-pairing in a nucleic acid duplex by contacting a nucleic 
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acid duplex having or suspected of having an abnormal base-pairing with 
a mutant DNA repair enzyme or complex thereof, where the mutant DNA 
repair enzyme or complex thereof has binding affinity for the abnormal 
base-pairing in the duplex but has attenuated catalytic activity and 
5 whereby the nucleic acid duplex containing an abnormal base-pairing 
binds to the mutant DNA repair enzyme or complex thereof to form a 
binding complex; subjecting the nucleic acid duplex to hydrolysis with an 
exonuclease under conditions such that the binding complex formed in 
the first step blocks hydrolysis; and then determining the location within 

10 the nucleic acid duplex protected from the hydrolysis, thereby detecting 
and localizing the abnormal base-pairing in the nucleic acid duplex. 

In a specific embodiment, the nucleic acid duplex to be assayed is 
a DNA:DNA, a DNA:RNA or a RNA:RNA duplex. Preferably, the nucleic 
acid duplex to be assayed is a DNA:DNA. 

15 In another specific embodiment, the abnormal base-pairing to be 

detected and localized is a base-pair mismatch, a base insertion, a base 
deletion or a pyrimidine dimer. Preferably, the base-pair mismatch to be 
detected and localized is a single base-pair mismatch. 

Any exonucleases can be used in the present methods. For 

20 example, the exonucleases with the following Genbank Accession Nos. 
can be used: AF194116 {Escherichia coli exonuclease X), API 91741 
(Arabidopsis thallana exonuclease RRP41 (RRP41)), AF013497 
(Pyrococcus furiosus endo/exonuclease (fen-1)), AF058396 
(Chlamydophila caviae strain GPIC ssDNA-specific exonuclease (recJ)}, 

25 AF151105 (Homo sapiens 3'-5' exonuclease TREX1 mRNA), AF151108 
(Mus musculus 3'-5' exonuclease TREX2), API 511 07 (Homo sapiens 3'- 
5' exonuclease TREX2 mRNA), API 51 106 (Mus musculus 3'-5' 
exonuclease TREX1), AF083915 (Chilo iridescent virus exonuclease II 
homolog (EX02)), AF1 40550 (Salmonella typhimurium exonuclease VII 

30 (xseA)), AF1 34570 (Xenopus laevis exonuclease Exol (EXOI)), AF084974 
(Homo sapiens exonuclease I (EXOI)), AP030933 (Homo sapiens 

-162- 



wo 01/62968 



PCTAJSOl/00452 



exonuclease homolog RADl (RADl)), AF034258 (Caenorhabditis elegans 
exonuclease III homolog), AH006967 (Homo sapiens exonuclease I 
(EXOI) gene), AF091740 (Homo sapiens exonuclease la (EXOIa), 5174 
(Schizosaccharomyces pombe exonuclease I (exol), AF084514 (Mus 
5 musculus DNA repair exonuclease (Red)), AF084513 (Homo sapiens 
DNA repair exonuclease (RECI)), AF084512 (Homo sapiens DNA repair 
exonuclease (RECI), AF060479 (Homo sapiens exonuclease I (EXOI), 
U76424 (Lactococcus lactis), U57401 (Choristoneura fumiferana alkaline 
exonuclease), U58147 (Haemophilus ducreyi), U86134 (Saccharomyces 

10 cerevisiae exonuclease 1 (EXOI), U57963 (Erwinia chrysanthemi single- 
stranded DNA exonuclease (recJ) gene), M22592 (E.coli xth gene 
encoding exonuclease III), J02641 (E.coli sbcB gene encoding 
exonuclease I), L23927 (Escherichia coli exonuclease VIII (recE) 

Preferably, exonucleases that specifically cleave double-stranded 

1 5 nucleic acids, but not single-stranded nucleic acids, are used in the 
present methods. Also preferably, nuclease BAL-31, exonuclease 111, 
Mung Bean exonuclease or Lambda exonuclease is used. 
G. LABELLING OF MUTANT DNA REPAIR ENZYMES 

Conjugates, such as fusion proteins and chemical conjugates, of 

20 the mutant DNA repair enzyme with a protein or peptide fragment (or 

plurality thereof) that functions, for example, to facilitate affinity isolation 
or purification of the mutant enzyme, attachment of the mutant enzyme . 
to a surface, or detection of the mutant enzyme are provided. The 
conjugates can be produced by chemical conjugation, such as via thiol 

25 linkages, but are preferably produced by recombinant means as fusion 
proteins. In the fusion, protein, the peptide or fragment thereof is linked 
to either the N-terminus or C-terminus of the mutant enzyme. In 
chemical conjugates the peptide or fragment thereof may be linked 
anywhere that conjugation can be effected, and there may be a plurality 

30 of such peptides or fragments linked to a single mutant enzyme or to a 
plurality thereof. 
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1 . Conjugation 

Conjugation can be effected by any method known to those of skill 
in the art. As described below/conjugation can be effected by chemical 
means, through covalent, ionic or any other suitable linkage. 
5 a. Fusion proteins 

Fusion proteins are provided herein. A fusion protein contains: a) 
one or a plurality of mutant DNA repair enzymes and b) at least one 
protein or peptide fragment that facilitates, for example: i) affinity 
isolation or purification of the fusion protein; ii) attachment of the fusion 

1 0 protein to a surface; or iii) detection of the fusion protein, or any 
combination thereof. 

The facilitating agent is selected to perform the desired purpose, 
such as (i) - (iii), and is linked a mutant DNA repair enzyme such that the 
resulting conjugate retains the mutant DNA repair enzyme property and 

1 5 also processes the property(ies) of the facilitating agent. For example, 
the facilitating agent can be a protein or peptide fragment, such as a 
protein binding peptide, including but not limited to an epitope tag or an 
IgG binding protein, a nucleotide binding protein, such as a DNA or RNA 
binding protein, a lipid binding protein, a polysaccharide binding protein, 

20 and a metal binding protein or fragments thereof that possess the 
requisite desired facilitating activity. 

Such facilitating agents can be designed, screened or selected 
according to the methods known in the art. The screening or selection 
process begins, for example, with nucleic acid encoding a particular 

25 protein or peptide to be used in the fusion protein, and screened or 
selected for its specific binding partner. Alternatively, the screening or 
selection process can start with a specific molecule that can be used in 
the subsequent isolation/purification, attachment or detection, and screen 
or select for a particular protein or peptide sequence to be used in the 

30 fusion protein that can specifically bind to the pre-selected molecule. 
The conventional technique of random screening of natural 
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products can be used in screening and selecting a protein or peptide 
sequence and its specific binding partner. In addition, numerous 
strategies can be used for preparing proteins having new binding 
specificities. These new approaches generally involve the synthetic 
5 production of large numbers of random molecules followed by some 
selection procedure to identify the molecule of interest. For example, 
epitope libraries have been developed using random polypeptides 
displayed on the surface of filamentous phage particles. The library is 
made by synthesizing a repertoire of random oligonucleotides to generate 

10 all combinations, followed by their insertion into a phage vector. Each of 
the sequences is separately cloned and expressed in phage, and the 
relevant expressed peptide can be selected by finding those phage that 
bind to the particular target. The phages recovered in this way can be 
amplified and the selection repeated. The sequence of the peptide is 

15 decoded by sequencing the DNA (See e.g., Cwiria et al., Proc. Natl. 
Acad, ScLr USA, 87:6378-6382 (1990); Scott et al., Science, 
249:386-390 (1990); and Devlin et al., Science, 249:404-406 (1990). 

Another approach involves large arrays of peptides that are 
synthesized in parallel and screened with acceptor molecules labelled with 

20 fluorescent or other reporter groups. The sequence of any effective 
peptide can be decoded from its address in the array {See e.g., Geysen 
et al., Proc. Natl. Acad. ScL, USA, 81:3998-4002 (1984); Maeji et al., J. 
Immunol. Met, 146 :83-90 (1992); and Fodor et al., Science, 
251:767-775 (1991). 

25 Combinatorial approaches can also be employed. For example, in 

one exemplary approach, combinatorial libraries of peptides are 
synthesized on resin beads such that each resin bead contains about 20 
pmoles of the same peptide. The beads are screened with labeled 
acceptor molecules and those with bound acceptor are searched for by 

30 visual inspection, physically removed, and the peptide identified by direct 
sequence analysis (Lam et al.. Nature, 354 :82-84 (1991)). Another 
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useful combinatory method for identification of peptides of desired 
activity is that of Houghten et al. (see, e.g.,, Nature, 354 :84-86 (1991)). 
For hexapeptides of the 20 natural amino acids, 400 separate libraries are 
synthesized, each with the first two amino acids fixed and the remaining 
5 four positions occupied by all possible combinations. An assay, based on 
competition for binding or other activity, is then used to find the library 
with an active peptide. Twenty new libraries are then synthesized and 
assayed to determine the effective amino acid in the third position, and 
the process is reiterated in this fashion until the active hexapeptide is 
10 defined. 

b. Chemical conjugation 
To effect chemical conjugation herein, the targeting agent is linked 

via one or more selected linlcers or directly to the targeted agent. 

Chemical conjugation must be used if the targeted agent is other than a 
1 5 peptide or protein, such a nucleic acid or a nop-peptide drug. Any 

means known to those of skill in the art for chemically conjugating 

selected moieties may be used, 

1) Heterobifunctional cross-linking reagents 
Numerous heterobifunctional cross-linking reagents that are used to 
20 form covalent bonds between amino groups and thiol groups and to 

introduce thiol groups into proteins, are known to those of skill in this art 

(see, e.g., the PIERCE CATALOG, ImmunoTechnology Catalog & 

Handbook, 1992-1993, which describes the. preparation of and use of 

such reagents and provides a commercial source for such reagents; see, 
25 also, e.g., Cumber et al. (1992) Bioconjugate Chem. 5':397-401; Thorpe 

etal. (1987) Cancer Res. 47:5924-5931; Gordon etal. Proc. 

Natl. AcadScL 54:308-312; Walden etal. (1986) J. Mol. CelUmmunol. 

2:191-197; Carlsson etal. (1978) Biochem. J. 173: 723-737; Mahan et 

al. {-[987) Anal. Biocherh. 762:163-170; Wawryznaczak a/. (1992) Br. 
30 J. Cancer 65:361-366; Fattom etal. (1992) Infection & Imnnun. 50:584- 

589), These reagents may be used to form covalent bonds between the 

.166- 



wo 01/62968 



PCT/USOl/00452 



mutant analyte binding enzyme and the facilitating agent. These reagents 
include, but are not limited to: N-succinimidyl-3-(2- 
pyridyldithio)propioriate (SPDP; disulfide linker); sulfosuccinimidyl 6-[3-(2- 
pyridyldithio)propionamido]hexanoate (sulfo-LC-SPDP); 
5 succinimidyloxycarbonyl-ar-methyl benzyl thiosulfate (SMBT, hindered 
disulfate linker); succinimidyl 6-I3-{2-pyridyldithio) propionamido]- 
hexanoate (LC-SPDP); sulfosuccinimidyl 4-(N-maleimidomethyl)cyclo- 
hexane-1-carboxylate (sulfo-SMCC); succinimidyl 3-(2-pyridyldithio)- 
butyrate (SPDB; hindered disulfide bond linker); sulfosuccinimidyl 2-(7- 

10 azido-4-methylcoumarin-3-acetamide) ethyl-1 ,3'-dithiopropibnate (SAED); 
sulfo-succinimidyl 7-azido-4-methylcoumarin-3-acetate (SAMCA); 
sulfosuccinimidyl 6-[alpha-methyl-alpha-(2-pyridyldithio)toluamido]- 
hexanoate (sulfo-LC-SMPT); 1 ,4-di-[3'-(2'-pyridyldithio)propion- 
amido]butane (DPDPB); 4-succinimidyloxycarbonyl-a-methyl-a-(2- 

15 pyridylthioltoluene (SMPT, hindered disulfate linker);sulfosuccinlmidyl6[a- 
methyl-a-(2-pyridyldithio)toluamidolhexanoate (sulfo-LC-SMPT); /tj- 
maleimidobenzoyl-N-hydroxysuccinimide ester (MBS); m-maleimidoben- 
zoyl-N-hydroxysulfosucGinimide ester (sulfo-MBS); N-succinimidyl(4- 
iodoacetyDaminobenzoate (SIAB; thioether linker); sulfosuccinimidyl(4- 

20 iodoacetyDamino benzoate (suifo-SlAB); succinimidy!4(p-maleimido- 

phenyDbutyrate (SMPB); sulfosuccinimidyl4-(p-maleimidophenyl)butyrate 
(sulfo-SMPB); azidobenzoyi hydrazide (ABH). 

Other heterobifunctional cleavable cross-linkers include, N- 
succinimidyl (4-iodoacetyl)-aminobenzoate; sulfosuccinimydil (4- 

25 iodoacetyD-aminobenzoate; 4-succinimidyl-oxycarbonyl-a-(2-pyridyldithio)- 
toluene; sulfosuccinimldyl-6- [a-methyl-a-{pyridyldithiol)-toluamido] 
hexanoate; N-succinimidyl-3-(-2-pyridyldithio) - proprionate; succinimidyl 
6l3{-(-2-pyridyldithio)-proprionamido] hexanoate; sulfosuccinimidyl 6[3(-(- 
2-pyridyldithio)-propionamido] hexanoate; 3-(2-pyridyldlthio)-propionyl 

30 hydrazide, Ellman's reagent, dichiorotriazinic acid, S-(2-thiopyridyl)-L- 

cysteine. Further exemplary Afunctional linking compounds are disclosed 
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in U.S. Patent Nos. 5,349,066, 5,618,528, 4,569,789, 4,952,394, and 
5,137,877. 

2) Exemplary Linkers 
Any linker known to those of skill in the art for preparation of 
5 conjugates may be used herein. These linkers are typically used in the 
preparation of chemical conjugates; peptide linkers may be incorporated 
into fusion proteins. 

Linkers can be any moiety suitable to associate the mutant DNA 
repair enzyme and the facilitating agent. Such linkers and linkages 

10 include, but are not limited to, peptidic linkages, amino acid and peptide 
linkages, typically containing between one and about 60 amino acids, 
more generally between about 10 and 30 amino acids,chemical linkers, 
such as heterobifunctional cleavable cross-linkers, including but are not 
limited to, N-succinimidyl (4-iodoacetyl)-aminobenzoate, sulfosuccinimydil - 

1 5 {4-iodoacetyl)-aminobenzoate, 4-succinimidyl-oxycarbonyl-a- (2- 

pyridyldithio)toluene, sulfosuccinimidyl-6- [a-methyl-a-(pyridyldithiol)- 
toluamido] hexanoate, N-succinimidyl-3-('2-pyridyldithio) - proprionate, 
succinimidyl 6[3(-(-2-pyridyldithio)-proprionamido] hexanoate, 
sulfosuccinimidyl 6[3(-(-2-pyridyldithio)-propionamido] hexanoate, 3-{2- 

20 pyridyldithio)-propionyl hydrazide, Eflman's reagent, dichlorotriazinic acid, 
and S-(2-thiopyridyl)-L-cysteine. Other linkers include, but are not limited 
to peptides and other moieties that reduce stearic hindrance between the 
mutant analyte binding enzyme and the facilitating agent, intracellular 
enzyme substrates, linkers that increase the flexibility of the conjugate, 

25 linkers that increase the solubility of the conjugate, linkers that increase 
the serum stability of the conjugate, photocleavable linkers and acid 
cleavable linkers. 
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Other exemplary linkers and linkages that are suitable for 
chemically linked conjugates include, but are not limited to, disulfide 
bonds, thioether bonds, hindered disulfide bonds, and covalent bonds 
between free reactive groups, such as amine and thiol groups. These 
5 bonds are produced using heterobifunctional reagents to produce reactive 
thiol groups on one or both of the polypeptides and then reacting the thiol 
groups on one polypeptide with reactive thiol groups or amine groups to 
which reactive maieimido groups or thiol groups can be attached on the 
other. Other linkers include, acid clea\/able linkers, such as 

10 bismaleimideothoxy propane, acid labile-transferrin conjugates and adipic 
acid diihydrazide, that would be cleaved in more acidic intracellular 
compartments; cross linkers that are cleaved upon exposure to UV or 
visible light and linkers, such as the various domains, such as Ch1, Ch2, 
and Ch3, from the constant region of human IgG^ (see, Batra et al. (1993) • 

15 Molecular Immunol. 50:379-386). In some embodiments, several linkers 
may be included in order to take advantage of desired properties of each 
linker. 

Chemical linkers and peptide linkers may be inserted by covalently 
coupling the linker to the mutant DNA repair enzyme and the facilitating 

20 agent. The heterobifunctional agents, described below, may be used to 
effect such covalent coupling. Peptide linkers may also be linked by 
expressing DNA encoding the linker and TA, linker and targeted agent, or 
linker, targeted agent and TA as a fusion protein. Flexible linkers and 
linkers that increase solubility of the conjugates are contemplated for use, 

25 either alone or with other linkers are also contemplated herein. 

a) Acid cleavable, photo cleavable and heat 
sensitive linkers 

Acid cleavable linkers, photocleavable and heat sensitive linkers 
may also be used, particularly where it may be necessary to cleave the 
30 targeted agent to permit it to be more readily accessible to reaction. Acid 
cleavable linkers include, but are not limited to, bismaleimideothoxy 
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propane; and adipic acid dihydrazide linkers (see, e.g., Fattom et ai 
(1992) Infection & Immun. 60:584-589) and acid labile transferrin 
conjugates that contain a sufficient portion of transferrin to permit entry 
into the intracellular transferrin cycling pathway (see, e.g., Welhoner et 
5 a/. (1991) J. Biol. Chem. 266:4309-4314). 

Photocleavable linkers are linkers that are cleaved upon exposure to 
light (see, e.g., Goldmacher ef a/. (1992) Bioconj. Chem. 3:104-107, 
which linkers are herein incorporated by reference), thereby releasing the 
targeted agent upon exposure to light. Photocleavable linkers that are 

10 cleaved upon exposure to light are known (see, e.g., Hazum et al. (1981) 
in Pept, Proc. Eur. Pept Symp., 16th, Brunfeldt, K (Ed), pp. 105-110, 
which describes the use of a nitrobenzyl group as a photocleavable 
protective group for cysteine; Yen etaL (1989) Makromol. Chem 190:69- 
82, which describes water soluble photocleavable copolymers, including • 

1 5 hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein 
copolymer and methylrhodamine copolymer; Goldmacher etaL (1992) 
Bioconj. Chem. 3:104-107, which describes a cross-linker and reagent 
that undergoes photolytic degradation upon exposure to near UV light 
(350 nm); and Senter et al. (1985) Photochem. Photobiol 42:23^ -237, 

20 which describes nitrobenzyloxycarbonyl chloride cross linking reagents 
that produce photocleavable linkages), thereby releasing the targeted 
agent upon exposure to light. Such linkers would have particular use in 
treating dermatological or ophthalmic conditions that can be exposed to 
light using fiber optics. After administration of the conjugate, the eye or 

25 skin or other body part can be exposed to light, resulting in release of the 
targeted moiety from the conjugate. Such photocleavable linkers are 
useful in connection with diagnostic protocols in which it is desirable to 
remove the targeting agent to permit rapid clearance from the body of the 
animal. 
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b) Other linkers for chemical conjugation 
Other linkers, include trityl linkers, particularly, derivatized 

trityl groups to generate a genus of conjugates that provide for 
release of therapeutic agents at various degrees of acidity or alkalinity. 
5 The flexibility thus afforded by the ability to preselect the pH range at 
which the therapeutic agent will be released allows selection of a linker 
based on the known physiological differences between tissues in need of 
delivery of a therapeutic agent (see, e.g., U.S. Patent No. 5,612,474). 
For example, the acidity of tumor tissues appears to be lower than that of 
10 normal tissues. 

c) Peptide linkers 

The linker moieties can be peptides. Peptide linkers can be 
employed in fusion proteins and also In chemically linked conjugates. The 
peptide typically a has from about 2 to about 60 amino acid residues, for 

1 5 example from about 5 to about 40, or from about 1 0 to about 30 amino 
acid residues. The length selected will depend upon factors, such as the 
use for which the linker is included. 

The proteinaceous ligand binds with specificity to a receptor{s) on 
one or more of the target cell(s) and is taken up by the target celKs). In 

20 order to facilitate passage of the chimeric ligand-toxin into the target cell, 
it is presently preferred that the size of the chimeric ligand-toxin be no 
larger than can be taken up by the target cell of interest. Generally, the 
size of the chimeric ligand-toxin will depend upon its composition. In the 
case where the chimeric ligand toxin contains a chemical linker and a 

25 chemical toxin (i.e., rather than proteinaceous one), the size of the ligand 
toxin is generally smaller than when the chimeric ligand-toxin is a fusion 
protein. Peptidic linkers can conveniently be encoded by nucleic acid and 
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incorporated in fusion proteins upon expression in a host cell, such as 
£. coI'l 

Peptide linkers are advantageous when the facilitating agent is 
proteinaceous. For example, the linker moiety can be a flexible spacer 
5 amino acid sequence, such as those known in single-chain antibody 
research. Examples of such known linker moieties include, but are not 
limited to, peptides, such as {Gly^Ser)^ and {Serf„Gly)n, in which n is 1 to 
6, preferably 1 to 4, more preferably 2 to 4, and m is 1 to 6, preferably 1 
to 4, more preferably 2 to 4, enzyme cleavable linkers and others. 

10 Additional linking moieties are described, for example, in Huston et 

aL, Proc. Natl. Acad. ScL U.S.A. 35:5879-5883, 1988; Whitlow, M., et 
al., Protein Engineering 6:989-995, 1993; Newton ef a/., Biocfiemistry 
55:545-553, 1996; A. J. Cumber et ai., Bioconj. Chem. 5:397-401, 
1992; Ladurner ef aA, J. l\/loL Biol. 275:330-337, 1997; and U.S. Patent. - 

15 No. 4,894,443. In some embodiments, several linkers-may be included in 
order to take advantage of desired properties of each linker. 
2. Selection of facilitating agents 

Any agent that facilitates detection, immobilization, or purification 
of the conjugate is contemplated for use herein. For chemical conjugates 
20 any moiety that has such properties is contemplated; for fusion proteins, 
the facilitating agent is a protein, peptide or fragment thereof that is 
sufficient to effect the facilitating activity. 

a. Protein binding moieties 
The conjugate contains a protein binding moiety, particularly a 
25 protein binding protein, peptide or effective fragment thereof. Its specific 
ipinding partner can be proteins or peptides generally, a set of proteins or 
peptides or mixtures thereof, or a particular protein or peptide. Any 
protein-protein interaction pair known to those ofskill in the art is 
contemplated. For exanriple, the protein-protein interaction pair can be • 
30 enzyme/protein or peptide substrate, antibody/protein or peptide antigen, 
receptor/protein or peptide ligand, etc. Any protein-protein interaction 
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pair can be designed, screened or selected according to the methods 
known in the art (See generally. Current Protocols in Molecular Biology 
(1998) § 20, John* Wiley & Sons, Inc.). Examples of such methods for 
identifying protein-protein interactions include the interaction trap/two- 
5 hybrid system and the phage-based expression cloning. 

1 ) interaction trap/two-hybrid system 
Interacting proteins can be identified by a selection or screen in 
which proteins that specifically interact with a target protein of interest 
are isolated from a library. One particular approach to detect interacting 

10 proteins is the two-hybrid system or interaction trap (See generally. 

Current Protocols in Molecular Biology (1998) § 20,1 .-20.2., John Wiley 
& Sons, Inc.), which uses yeast as a "test tube" and transcriptional 
activation of a reporter system to identify associating proteins. 

In the two-hybrid system, a yeast vector such as the plasmid 

1 5 pEG202 or a related vector can be used to express the probe or "bait" 
protein as a fusion to the heterologous DNA-binding protein LexA. Many 
proteins, including transcription factors, kinases, and phosphatases, can 
be used as bait proteins. The major requirements for the bait protein are 
that it should not be actively excluded from the yeast nucleus, and it 

20 should not possess an intrinsic ability to strongly activate transcription. 
The plasmid expressing the LexA-fused bait protein can be used to 
transform yeast possessing a dual reporter system responsive to 
transcriptional activation through the Z.exA operator. 

In one such example, the yeast strain EGY48 containing the 

25 reporter plasmid pSH18-34 can be used. In this case, binding sites for 
LexA are located upstream of two reporter genes. In the EGY48 strain, 
the upstream activating sequences of the chromosomal LEU2 gene, 
which is required in the biosynthetic pathway for leucine (Leu), are 
replaced with Z.exA operators (DNA binding sites). PSH18-34 contains a 

30 LexlK operator-/acZ fusion gene. These two reporters allow selection for 
transcriptional activation by permitting selection for viability when cells 
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are plated on medium lacking Leu, and discrimination based on color 
when the yeast is grown on medium containing Xgal. 

The EGY48/PSH 18-34 transformed with a bait is first characterized 
for its ability to express protein, growth on medium lacking Leu, and for 
5 the level of transcriptional activation of lacZ, A number of alternative 
strains, plasmids, and strategies can be employed if a bait proves to have 
an unacceptably high level of background transcriptional activation. 

In an interactor hunt, the stain EGY48/PSH 18-34 containing the 
bait expression plasmid is transformed, preferably along with carrier DNA, 

10 with a conditionally expressed library made in a suitable vector such as 
the vector pJG4-5. This library uses the inducible yeast GAL1 promoter 
to express proteins as fusions to an acidic domain ("acid blob") that 
functions as a portable transcriptional activation motif (act) and to other 
useful moieties. Expression of library-encoded proteins is induced by 

15 plating transformants on medium containing galactose (Gal), so yeast 
cells containing library proteins that do not interact specifically with the 
bait protein will fail to grow in the absence of Leu. Yeast cells containing 
library proteins that interact with the bait protein will form colonies within 
2 to 5 days, and the colonies will turn blue when the cells are streaked 

20 on medium containing Xgal. The DNA from interaction trap positive 
colonies can be analyzed by polymerase chain reaction (PGR) to 
streamline screening and detect redundant clones in cases where many 
positives are obtained in screening. The plasmids can be isolated and 
characterized by a series of tests to confirm specificity of the interaction 

25 with the initial bait protein. 

An alternative way of conducting an interactor hunt is to mate a 
strain that expresses the bait protein with a strain that has been 
pretransformed with the library DNA, and screen the resulting diploid cells 
for interactors (Bendixen et al., Nucl. Acids. Res., 22:1778-1779 (1994); 

30 and Finley and Brent, Proc. Natl. Sci. U.S.A., 91:12980-12984 (1994)). 
This "interaction mating" approach can be used for any interactor hunt, 
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and is particularly useful in three special cases. The first case is when 
more than one bait will be used to screen a single library. Interaction 
mating allows several interactor hunts with different baits to be 
conducted using a single high-efficiency yeast transforrriation with library 
5 DNA. This can be a considerable savings, since the library transformation 
is one of the most challenging tasks in an interactor hunt. The second 
case is when a constitutively expressed bait interferes with yeast 
viability. For such baits/ performing a hunt by interaction mating avoids 
the difficulty associated with achieving a high-efficiency library 

10 transformation of a strain expressing a toxic bait. Moreover, the actual 
selection for interactors will be conducted in diploid yeast, which are 
more vigorous than haploid yeast and can better tolerate expression of 
toxic proteins. The third case is when a bait cannot be used in a 
traditional interactor hunt using haploid yeast stains because it activates - 

15 transcription of even the least sensitive reporters. In diploids the 

reporters are less sensitive to transcription activation than they are in 
haploids. Thus, the interaction mating hunt provides an additional 
method to reduce background from transactivating baits. 

The interaction trap/two-hybrid system and the identified protein- 

20 protein interaction pairs have been successfully used (see, e.g., Bartel et 
al.. Using the two-hybrid system to detect protein-protein interactions. In 
Cellular Interactions in Development: A Practical Approach, (D.A; Hartley, 
ed.) pp. 153-179, Oxford University Press, Oxford (1993); Bartel et al., A 
protein linkage map of Escherichia coli bacteriophage T7, Nature Genet, 

25 12:72-77 (1996); Beridixen et al., A yeast mating-selection scheme for 
detection of protein-protein interactions, NucL Acids. Res., 22:1778- 
1779 (1994); Breeden and Nasmyth, Regulation of the yeast HO gene., 
Cold spring Harbor Symp. Quant Biol, 50:643-650 (1985); Brent and 
Ptashne, A bacterial repressor protein or a yeast transcriptional terminator 

30 can block upstream activation of a yeast gene, Nature, 5/2:61 2-61 5 
(1984); Brent etaL, A eukaryotic transcriptional activator bearing the 

.175- 



wo 01/62968 



PCTAJSOl/00452 



DNA specificity of a prokaryotic repressor. Cell, 45:729-736 (1985); 
Chien et al., The two-hybrid system: A method to identify and clone 
genes for proteins that interact with a protein of interest, Proc. Natl. 
Acad. Sci. U.S.A., 55:9578-9582 (1991); Chiu et aL, RAPT1, a 
5 mammalian homolbg of yeast Tor, interacts with the FKBP12/rapamycin 
complex, Proc. Nat Acad. Sci., U.S.A., 57:12574-12578 (1994); Colas 
et al.. Genetic selection of peptide aptamers that recognize and inhibit 
cyclin-dependent kinase 2., Nature, 550:548-550 (1996); Durfee et al.. 
The retinoblastoma protein associates with the protein phosphatase type 

10 1 catalytic subunit. Genes & Dev., 7:555-569 (1993); Estojak et al., 
Correlation of two-hybrid affinity data with in vitro measurements, MoL 
Cell. Biol., 75:5820-5829 (1995); Fearon et al., Karyoplasmic interaction 
selection strategy: A general strategy to detect protein-protein interaction 
in mammalian cells, Proc. Nat, Acad. Sci. U.S.A., 55:7958-7962 (1992); - 

1 5 Fields and Song, A novel genetic system to detect protein-protein 

interaction. Nature, 540:245-246 (1989); Finley and Brent, Interaction 
mating revels binary and ternary connections between Drosophila cell 
cycle regulators, Proc. Natl. Sci. U.S.A., 57:12980-12984 (1994); Gietz 
et al.. Improved method for high-efficiency transformation of intact yeast 

20 cells, NucL Acids. Res., 20:1425 (1992); Golemis and Brent, Fused 
protein domains inhibit DNA biding by LexA, MoL Cell Biol., 72:3006- 
3014 (1992); Gyuris et al., Cdil, a human G1 and S-phase protein 
phosphatase that associates with Cdkl, Cell, 75:791-803 (1993); Kaiser 
et al., A-, Methods in Yeast Genetics, a Cold Spring Harbor Laboratory 

25 Course Manual, pp. 135-136. Cold Spring Harbor- Laboratory Press, Cold 
Spring Harbor, N.Y. (1994); Kolonin and Finley, Jr., Targeting cyclin- 
dependent kinases in Drosophila with peptide aptamers, Proc. Natl. Acad. 
Sci. U.S.A., /n press (1998); Licitra and Liu, A three-hybrid system for 
detecting small ligand-prptein receptor interactions, Proc. Nat Acad. Sci. 

30 U.S.A., 55:12817-12821 (1996); Ma and Ptashne, A new class of yeast 
transcriptional activators, Cell, 57:113-119 (1987); Ma and Ptashne, 
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Converting an eukaryotic transcriptional inhibitor into an activator, Cell, 
55:443-446 (1988); Osborne et a!.. The yeast tribrid system: Genetic 
detection of transphosphorylated ITAM-SH2 interactions, Bio/Technology, 
75:1474-1478 (1995); Ruden et al., Generating yeast transcriptional 
5 activators containing no yeast protein sequences, Nature, 350/426-430 
(1991); Samson et al., Gene activation and DNA binding by Drosophila 
Ubx and abd-A proteins, Cell, 57:1045-1052 (1989); Stagljar et al., Use 
of the two-hybrid system and random sonicated DNA to identify the 
interaction domain of a protein, BioTechniques, 27:430-432 (1996); 

10 Vasavada et al., A contingent replication assay for the detection of 

protein-protein interactions in animal cells, Proc. Nat Acad. ScL U.S.A., 
5S: 10686- 10690 (1991); Vojtex et al., Mammalian Ras interacts directly 
with the serine/threonine kinase Raf, Cell, 74:205-214 (1993); Watson et 
al.. Vectors encoding alternative antibiotic resistance for use in the yeast - 

15 two-hybrid system, BioTechniques, 27:255-259 (1996); West et al., 
Saccharomyces cerevisiae GAL10 divergent promoter region: Location 
and function of the upstream activator sequence UASG, MoL Cell Biol., 
4:2467-2478 (1984); and Yang et al., Protein-peptide interactions 
analyzed with the yeast two-hybrid system, NacL Acids Res., 25:1 1 52- 

20 1 156 (1995)) and can be used in the present system. 

2) Phage-based expression cloning 
Interaction cloning (also known as expression cloning) is a 
technique to identify and clone genes that encode proteins that interact 
with a protein of interest, or "bait" protein. Phage-based interaction 

25 cloning requires a gene encoding the bait protein and an appropriate 

expression library constructed in a bacteriophage expression vector, such 
as ^Qt^ 1 (See generally. Current Protocols In Molecular Biology (1 998) § 
20.3, John Wiley & Sons, Inc.). The gene encoding the bait protein is 
used to produce recombinant fusion protein in f. coll. The cDNA is 

30 radioactively labeled with ^^P. A recognition site for a protein kinase such 
as the cyclic adenosine 3',5'-phosphate (cAMP)-dependent protein 
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kinase (Protein kinase A; PKA) is introduced into the recombinant fusion 
protein to allow its enzymatic phosphorylation by the kinase and lA- 
"P]ATP. 

In one example, the procedure involves a fusion protein containing 
5 bait protein and glutathione-S-transferase (GST) with a PKA site at the 
junction between them. The labeled protein is subsequently used as a 
probe to screen a A bacteriophage-derived cDNA expression library, which 
expresses /ff-galactosidase fusion proteins that contain in-frame gene 
fusions. The phages lyse cells, form plaques, and release fusion proteins 

10 that are adsorbed onto nitrocellulose membrane filters. The filters are 
blocked with excess nonspecific protein to eliminate nonspecific binding 
and probed with the radiolabeled bait protein. This procedure leads 
directly to the isolation of genes encoding the interacting protein, 
bypassing the need for purification and microsequencing or for antibody 

1 5 production. 

The phage-based interaction cloning system and the identified 
protein-protein interaction pairs have been successfully employed (Blanar 
et al.. Interaction cloning: Identification of a helix-loop-helix zipper protein 
that interacts with c-Fos, Science, 255:1014-1018 (1992); Carr and 

20 Scott, Blotting and band-shifting: Techniques for studying protein-protein 
interactions, Trends Biochem. ScL, 77:246-249 (1992); Chapline et al., 
Interaction cloning of protein kinase C substrates, J. Bio/. Chem., 
268:6858-6861 (1993); Hoeffler et al.. Identification of multiple nuclear 
factors that interact with cyclic AMP response element-binding protein 

25 and activation transcription factor-2 by protein interactions, Mol. 
Endocrinol., 5:256-266 (1991); Kaelin et al.. Expression cloning of a 
cDNA encoding a retinoblastoma-binding protein with E2F-like properties, 
Cell, 70:351-364 (1992); Lester et al., Cloning and characterization of a 
novel A-klnase anchoring protein: AKAP220, association with testicular 

30 peroxisomes, J, BioL Chem., 27/:9460-9465 (1996); Lowenstein et al.. 
The SH2 and SH2 domain-containing protein GRB2 links receptor tyrosine 
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kinase to ras signaling, Cell, 70:431-442 (1992); Margolis et al.. High- 
efficiency expression/cloning of epidermal growth factor-receptor-binding 
proteins with src homology 2 domains, Proc. Natl, Acad. Sci. U.S.A., 
55:8894-8898 (1992); Skolnik et al., Cloning of PI 3 kinase-associated 
• 5 p85 utilizing a novel method of expression/cloning of target proteins for 
receptor tyrosine kinases, Cell, 55:83-90 (1991); and Stone et a!., 
Interaction of a protein phosphatase with an Arabidopsis serine-threonine 
receptor kinase. Science, 266:793-795 (1994)) and can be used in the 
present system. 

1 0 3) Detection of protein-protein interactions 

Surface plasmon resonance (SPR) can be used to verify the protein- 
protein interactions identified by other systems such as the interaction 
trap/two-hybrid system and the phage-based expression cloning systems 
(See generally. Current Protocols In Molecular Biology (1998) § 20,4, 

1 5 John Wiley & Sons, Inc.). This is an in vitro technique based on an 
optical phenomenon, called SPR, that can simultaneously detect 
interactions between unmodified proteins and directly measure kinetic 
parameters of the interaction. 

SPR devices are commercially available. The BIAcore instrument 

20 (BIAcore) is presently preferred herein. This instrument contains sensing 
optics, an automated sample delivery system, and a computer for 
instrument control, data collection, and data processing. Experiments are 
performed on disposable chips.. In practice, a ligand protein is 
immobilized on the chip while buffer continuously flows over the surface. 

25 The sensing apparatus monitors changes in the angle of minimum 

reflectance from the interface that result when a target protein associates 
with the ligand protein. Molecular interactions can be directly visualized 
(on the computer monitor) in real time as the optical response is plotted 
against time. This response is measured in resonance units (RUs, where 

30 1000 RUs = 1 ng protein/mm^). 

The SPR system has been successfully used (see, e.g., 
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BioSupplyNet Source Book, BioSupplyNet, Plainview, N.Y., and Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1999); Feng et 
a!., Functional binding between Gyff and the LIM domain of Ste5 is 
required to activate the IVIEKK Stall, Cur. BioL, 5:267-278 (1998); Field 
5 et al., Purification of RAS-responsive adenylyl cyclase complex from 
Sacchariomyces cerevisiae by use of an epitope addition method, MoL 
Cell. BloL, 5:2159-2165 (1988); Phizicky and Fields, Protein-protein 
interactions: Methods for detection and analysis, Microbiol. Rev., 53:94- 
123 (1995); Tyers et a!., Comparison of the Saccharomyces cerevisiae 

10 G1 cyclins: Cln3 may be an upstream activator of CInl, Cln2, and other 
cyclins, Ei\/IBO J., 7 7:1773-1784 (1993)) and the identified protein- 
protein interaction pairs can be used in the present system, 
b. Epitope tags 
The facilitating agent can be any moiety, particularly a protein, 

1 5 peptide or effective fragment thereof that is specifically recognized by an 
antibody. In these embodiments, the conjugate contains an epitope tag 
that is specifically recognized by a set of antibodies or by a particular 
antibody. Any epitope/antibody pair can be used in the present system 
(See generally. Current Protocols in Molecular Biology ( 1 998) 10.15, 

20 John Wiley & Sons, Inc.). The following Table 3 provides exemplary 
epitope tags and illustrates certain properties of several commonly used 
epitope tag systems. . 

Table 3. Exemplary epitope tag systems 



Epitope 


Peptide 


SEQ 
ID 


Antibody 


Reference 


FLAG 


AspTyrLysAspAspAspLys 


1 


4E11 


Prickett^ 


HA 


TyrProTyrAspValPRoAspTyrAla 


2 


12Ca5 




HA1 


CysGlnAspLeuProGlyAsnAspAsnSerThr 


3 


mouse MAb 


Nagelkerken^ 


c-Myc 


GluGlnLysLeulleSerGluGiuAspLeu 


4 


9E10 




6-His 


HisHisHisHisHisHis 


5 


BAbCO* 




AU1 


AspThrTyrArgTyrlle • 


6 


BAbCO 




EE 


GluTyrMetProMetGlu 


7 


anti-EE 


Tolbert* 
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Epitope 


Peptide 


SEQ 
ID 


Antibody 


Reference 


T7 


AlaSerMetThrGiyGlyGlnGlnMetGlyArg 


8 


Invitrogen 


Chen^ 
Tseng^ 


4A6 


SerPheProGlnPheLysProGlnGlulle 


9 


4A6 


Rudiger^ 




LysGlyPheSerTyrPheGlyGIuAspLeuMetPro 


10 


anti-PKC6 


Olah^ 


B 


GInTyrProAlaLeuThr 


11 


D11, FIO 


Wang^ 


gE 


GInArgGlnTyrGlyAspValPheLysGlyAsp 


12 


383 


Grose'° 


Tyi 


GluVaiHisThrAsnGlnAspProLeuAsp 


13 


882, TYG5 


Bastin^^ 



1 . Prickett et al., BioTechniques, 7161:580-584 (1989) 

2. Xie et al., Endocrinology, 139(1 1) :4563-4567 (1 998) 

1 0 3. Nagelkerke et al., Electrophoresis, 18:2694-2698 (1 997) 

4. Tolbert and Lameh, J. Neurochem., 70:1 13-1 19 (1 998) 

5. Chen and Katz, BioTechniques, 25(1) :22-24 (1998) 

6. Tseng and Verma, Gene, 169:287-288 (1996) 

7. Rudiger et al., BioTechniques, 23(1) :96-97 (1997) 
15 8. Olah et al., Biochem,, 221:94-102 (1994) 

9. Wang et al., Gene, 169(1) :53-58 (1 996) 

10. Grose, U.S. Patent No. 5,710,248 

11. Bastin et al., (Viol. Biochem, Parasitology, 77:235-239 (1996) 
Invitrogen, Sigma, Santa Cruz Biotech 

20 For example, in one embodiment, the selected epitope tag is the 6- 

His tag. Vectors for constructing a fusion protein containing the 6-His 
tag and reagents for isolating or purifying such fusion proteins are 
commercially available. For example, the Poly-His gene fusion vector 
from Invitrogen, Inc. (Carlsbad, CA) includes the following features: 1) 

25 high-level regulated transcription for the trc promoter; 2) enhanced 

translation efficiency of eukaryotic genes in E.co/i; 3) the LacO operator 
and the Lacf repressor gene for transcriptional regulation in any E coll 
system; N-terminal Xpress epitope for easy detection with an Anti-Xpress 
antibody; and 4) enterokinase cleaving site for removal of the fusion tag. 

30 The fusion protein can be purified by nickel-chelating agarose resin, and 
the purified fusion protein can be coated onto a microtiter plate pre- 
coated with nickel {e.g., Reacti-Blnding meta chelate polystyrene plates, 
Pierce) for diagnostic usage. 
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In addition, the fusion protein containing the 6-His tag can be 
isolated or purified using the His MicroSpin Purification Module or HisTrap 
Kit from Amersham Pharmacia Biotech, Inc. The His MicroSpin 
Purification Module provides fifty MicroSpin columns prepacked with 
5 nickel-charged Chelating Sepharose Fast Flow, The module enables the 
simple and rapid screening of large numbers of small-scale bacterial 
lysates for the analysis of putative clones and optimization of expression 
and purification conditions. Each column contains 50 //I bed volume, 
enough to purify > 100 /yg his-tagged fusion protein, from up to 400 //I 

10 of His-tagged fusion protein sample, e.g., crude lysate and purification 
intermediates. The HisTrap Kit is designed for rapid, mild affinity 
purification of histidine-tagged fusion proteins in a single step. The high 
dynamic capacity of HiTrap Chelating enables milligrams of protein to be 
purified in less than 15 minutes at flow rates of up to 240 column 

1 5 volumes per hour. The high capacity is maintained after repeated use 
ensuring cost-effective, reproducible purifications. The Kit includes three 
HiTrap Chelating columns and buffer concentrates to perform F10-12 
purifications with a syringe. The anti-His antibody from Amersham 
Pharmacia Biotech, Inc. is an lgG2 subclass of monoclonal antibody 

20 directed against 6 Histidine residues. The antibody is unconjugated to 
offer the flexibility of detection with a secondary antibody conjugated 
with either horseradish peroxidase or alkaline phosphatase. The antibody 
provides high sensitivity with low background. 

Further examples of epitope tagging can be found in Kolodziej and 

25 Young, Epitope tagging and protein surveillance. Methods EnzymoL, 
194:508-519 (1991). Methods for preparing and using other such tags 
and other such tags similarly can be used in the methods and products 
provided herein. 
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c. IgG binding proteins 
In other embodiments, the conjugate contains an IgG binding 
protein, wh!ch, for example provides a means for selective binding of the 
conjugate. Any IgG binding protein/IgG pair can be used in the present 
5 system. Protein A and Protein G are suitable facilitating. Any Protein A or 
Protein G can be used in the present system. 

For example, the following nucleotide sequences can be used for 
amplifying and constructing Protein A or Protein G fusion proteins: 
E04365 (Primer for amplifying IgG binding domain AB of protein A); 
10 E04364 (Primer for amplifying IgG binding domain AB of protein A); 

E01756 (DNA sequence encoding subunit which can bind IgG of protein 
A like substance); I\/I74187 (Cloning vector pKP497 (cloning, screening, 
fusion vector) encoding an IgG-bindIng fusion protein from protein A 
analogue (ZZ) and beta-Gal' (lacZ) genes). In addition, several Protein A 
1 5 gene fusion vectors such as pEZZ 1 8 and pRIT2T are commercially 
available (Amersham Pharmacia Biotech, Inc.). 

1) pEZZ 18 Protein A gene fusion vector 
pEZZ 18 Protein A gene fusion vector can be used for rapid 
expression of secreted fusion proteins and their one-step purification 
20 using IgG Sepharose 6FF. The phagemid pEZZ 1 8 contains the proteins 
A signal sequence and two synthetic "Z" domains based on the "B" IgG 
binding domain of Protein A (Lowenadler., et al.. Gene, 58:87 (1987); 
and Nilsson., et al., Prot Engineering, 1:107 (1987)). Proteins are 
expressed as fusions with the "ZZ" peptide and secreted into the aqueous 
25 culture medium under the direction of the protein A signal sequence. 
They are easily purified using IgG Sepharose 6FF to which the "ZZ" 
domain binds tightly. Because of its unique folding properties, the 14 
kDa "ZZ" peptide has little effect on folding of the fusion partner into a 
native conformation. 
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Expression 

Expression is controlled by the /acUV5 and protein A promoters 
and is not inducible. Elements of the protein A gene provide the 
ATG and ribosome-binding sites. Stop codons must be provided by 
5 the insert. 

Sequencing 

The Ml 3 Universal Sequencing Primer is used for double-stranded 
and single-stranded sequencing. A protocol for production of 
single-stranded DNA is provided with the vector. 
10 Cloning 

Inserts containing a stop codon will yield white colonies when 
grown on media containing X-gal. 

Host(s) 

E. coli strains carrying a lac deletion but capable of a- 
15 complementation of /acZ'. 

Selectable marker(s) 
Plasmid confers resistance to ampicillin. 

Amplification 

Amplification, though not necessarily required can be included. 

20 2) pRIT2T Protein A gene fusion vector 

The pRIT2T Protein A gene fusion vector (available from 
Pharmacia) can be used for high-level expression of intracellular fusion 
proteins, pRIT2T, a derivative of pRIT2 (Nilsson., et al., EMBO J., 
4:1075 (1985)), contains the IgG-binding domains oi staphylococcal 

25 protein A which permits rapid affinity purification of fusion proteins on 
IgG Sepharose 6 FF. Thermo-inducible expression of the fusion protein is 
achieved in a suitable £. coll host strain which carries the temperature- 
sensitive repressor c/857 (N4830-1) {Zabeau and Stanley, EMBO J., 
VMM (1982)). 
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Induction 

The i^Pp' promoter is induced by shifting the growth temperature 
from 30°C to 42°C for 90 minutes. 

Expression 

5 Genes inserted into the MCS are expressed from the A right 

promoter (Pr) as fusions with the IgG-binding domains of 
staphylococcal protein A. A portion of the A cm gene, fused to the 
IgG-binding domain, supplies the ATG start codon. Since no signal 
sequence is provided, the protein remains intracellular. Protein A 
10 gene transcription and translation termination signals are provided. 

Fusion protein can be purified on IgG Sepharose 6FF (17-0969-01). 
The protein A earner protein is -30 kDa, 
Host(s) 

E. CO// N4830-1/N99cr. Supplied with E. co// N4830-1 which 
1 5 contains the temperature-sensitive c/857 repressor. 

Selectable marker(s) 
Plasmid confers resistance to ampicillin. 

3) The IgG Sepharose 6 fast flow system 
The Protein A and Protein G fusion protein can be isolated or 
20 purified by affinity binding with IgG, such as the IgG Sepharose 6 Fast 
Flow System (Amersham Pharmacia Biotech, Inc.). The IgG Sepharose 6 
Fast Flow System includes IgG coupled to the highly cross-linked 6% 
agarose matrix Sepharose 6 Fast Flow, and is designed for the rapid 
purification of Protein A and Protein A fusion conjugates. The system 
25 binds at least 2 mg Protein A/ml drained gel with flow possible rates of 
300 cm/hr at 1 bar (14.5 psi, 0.1 MPa) In an XK 50/30 column 
(Lundstrom et al.. Biotechnology and Bioengirieering, 36:1056 (1990)). 
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d. /?-galactosidase fusion proteins 

The pMC1871 fusion vector (commercially available from 
Pharmacia, see, also Shapira et aL Gene 25'J\ (1983); Casadaban et aL 
Methods Enzymol. 100:293 (1983)) for production of enzymatically 
5 active ;S-galactosidase hybrid proteins for gene expression or functional 
studies. Vector pMC1871 is derived from pBR322 and contains a 
promoterless lacZ gene, which also lacks a ribosome-binding site and the 
first eight non-essential N-terminal amino acid codons. Its unique Sma I 
site allows fusions to the N-terminal part of the yff-galactosidase gene. 

1 0 Insertion of a gene into the E coli lacZ gene results in the production of a 
hybrid protein, whose presence can be readily detected by following its 
galactosidase activity (Miller, J.H., in Experiments in Molecular Qener. 
(Cold Spring Harbor, N.Y,) (1972); Nielsen etaL Proc. Natl. Acad. Sci. 
U.S.A., 50:5198 (1983)). Hybrid proteins can then be easily purified by 

15 affinity chromatography (Germino et a/. Proc. Natl. Acad. Sci. U.S.A., 81: 
4692 (1984)). Multiple cloning sites flanking the lacZ gene permit its 
excision as a BamH I, Sal I, Pst I or EcoR I gene cassette. If lacZ is 
excised as an EcoRI cassette, a portion of its 3'-end will be deleted. The 
resulting ^-galactosidase protein (a-donor) will be functional if the C- 

20 terminus of the /?-galactosidase protein (a-acceptor) is available through 
intercistronic complementation. 

Expression 

Inserts cloned into the unique Sma I site give fusion proteins with 
the N-terminal part of /ff-galactosidase. Insert must contain a promoter, 
25 ATG and ribosome-binding site. 

Host(s) 

E. coil strains carrying a lac deletion. 

Selectable markerfs) 
Plasmid confers resistance to 1 5 //g/ml tetracycline. 
30 GenBank Accession Number L08936. 
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e. Nucleic acid binding moieties 
In another embodiment the conjugate includes a nucleotide binding 
protein, peptide or effective fragment thereof as a facilitating agent. The 
specific binding partner can be nucleotide sequences generally, a set of 
5 nucleotide sequences or a particular nucleotide sequence. Any protein- 
nucleotide interaction pair can be used in the present system. For 
example, the protein-nucleotide interaction pair can be protein/DNA or 
protein/RNA pairs, or a combination thereof. Protein-nucleotide 
interaction pairs can be designed, screened or selected according to the 

10 methods known in the art {See generally. Current Protocols in Molecular 
Biology (1998) § 12, John Wiley & Sons, Inc.). Examples of such 
methods for identifying protein-nucleotide interactions include the gel 
mobility shift assay, methylation and uracil interference assay, DNase I 
footprint analysis, AgX^ 1 expression library screening and rapid separation - 

15 of protein-bound DNA from free DNA using nitrocellulose filters. 

1 ) DNA binding proteins 
The conjugate can contain a DNA binding protein and its specific 
binding partner can be DNA molecules generally, a set of DNA molecules 
or a particular sequence of nucleotides. Any DNA binding protein can be . 

20 used in the present system. For example, the DNA binding protein can 
bind to a single-stranded or double-stranded DNA sequence, or to an A-, 
B- or Z-form DNA sequence. The DNA binding sequence can also bind to 
a DNA sequence that is involved in replication, transcription, DNA repair, 
recombination, transposition or DNA structure maintenance. The DNA 

25 binding sequence can further be derived from a DNA binding enzyme 
such as a DNA polymerase, a DNA-dependent RNA polymerase, a 
DNAase, a DNA ligase, a DNA topoisomerase, a transposase, a DNA 
kinase, or a restriction enzyme. 

Any DNA binding sequence/DNA sequence pair can be designed, 

30 screened or selected according to the methods known in the art including 
methods described in Section L.2. above. 
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The following Table 4 illustrates certain properties of several DNA 
binding sequence/DNA sequence pair systems. 



Table 4. Examples of DNA binding sequer)ce/DNA sequence binding 
pairs 



DNA binding 
sequence 


DNA binding 
sequence motif 


DNA sequence 


Reference 

(U.S. Patent No.) 


NF-ATp 

(SEQ ID NO. 14) 


T lymphocyte 

DNA-bindIng 

protein 


GCCCAAAGAGGAA 
AATTTGTTTCATAC 
AG (SEQ ID NO. 15) 


5,656,452 


Max 

(SEQ ID NO. 16) 


helix-Joop-helix 
zipper protein 


CACGTG 


5,693,487 


Chicken Lung 140 
Kd Protein 




Z-DNA 


5,726,050 


EGR1, EGR2, GLI, 
Wilm's tumor 
gene, Spl , 
Hunchback, 
Kruppel, ADR1 and 
BrLA 


Zinc finger 
proteins 


GACC, GCAC 


5,789,538 


LIL-Stat protein 


Stat 

family of 

transcription 

factors 


TTNCNNAGA, 
TTCCTGAGA 


5,821,053 


Egr 

(SEQ ID NO. 17) 


zinc finger protein 


CGCCCCCGC 


5,866,325 


SI -3 protein (SEQ 
ID NO. 18) 


zinc finger protein 


CATRRWWG 


5,905,146 



25 2) RNA binding proteins 

In another preferred embodiment, the conjugate can contain an 
RNA binding protein and its specific binding partner can be RNA 
generally, a set of RNA molecules or a particular sequence of 
ribonucleotides. Any RNA binding protein can be used in the present 

30 system. For example, the RNA binding protein can bind to a single- 
stranded or double-stranded RNA, or to rRNA, mRNA or tRNA. The RNA 
binding protein may specifically bind to a RNA that is involved in reverse 
transcription, transcription, RNA editing, RNA splicing, translation, RNA 
stabilization, RNA destabilization, or RNA localization. The RNA binding 
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protein can be derived from or be an RNA binding enzyme such as a RNA- 
dependent DNA polymerase, a RNA-dependent RNA polymerase, a 
RNase, a RNA ligase, a RNA maturase, or a ribosome. 

Other RNA recognition sequence or binding motifs that can be used 
5 in the present system include the zinc-finger motif, the Y=box, the KH 
motif, AUUUA, histone, RNP motif (Ul), arginine-rich motif (ARM or 
PRE), double-stranded RNA binding motifs (IRE) and RGG box (APR) (U.S. 
Patent Nos. 5,834,184, 5,859,227 and 5,858,675). The RNP motif is a 
90-100 amino acid sequence that is present in one or more copies in 

10 proteins that bind pre mRNA, mRNA, pre-ribosomal RNA and snRNA. 
The consensus sequence and the sequences of several exemplary 
proteins containing the RNP motif are provided in Burd and Dreyfuss, 
Science, 265 :615-621 (1994); Swanson et al.. Trends Biochem. Sci., 
13:86 (1988); Bandziulis et aL, Genes Dev., 3:431 (1989); and Kenan et - 

15 al., Trends Biochem. Sci., 16:214 (1991). The RNP consensus motif 
contains two short consensus sequences RNP-1 and RNP-2. Some RNP 
proteins bind specific RNA sequences with high affinities (dissociation 
constant In the range of 10'®-10'^^ M), Such proteins often function in 
RNA processing reactions. Other RNP proteins have less stringent 

20 sequence requirements and bind less strongly (dissociation constant 
about 10-^-10-^ M) (Burd & Dreyfuss, ElVIBOJ., 13:1197 (1994)). 

A second characteristic RNA binding motif found in viral, phage 
and ribosomal proteins is an arginine-rich motif (ARM) of about 10-20 
amino acids. RNA binding proteins having this motif include the HIV Tat 

25 and Rev proteins. Rev binds with high affinity disassociation constant 
(10'^ M) to an RNA sequence termed RRE, which is found in all HIV 
mRNAs (Zapp et al., Nature, 342:714 (1989); and Dayton et al., Science, 
246:1 625 (1 989)). Tat binds to an RNA sequence termed TAR with a 
dissociation constant of 5X10'^ M (Churcher et al., J. (Viol. Biol., 230 :90 

30 (1993)). For Tat and Rev proteins, a fragment containing the 

arginlne-rlch motif binds as strongly as the Intact protein. In other RNA 
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binding proteins with ARM motifs, residues outside the ARM also 
contribute to binding. 

The double-stranded RNA-binding domain (dsRBD) exclusively 
binds double-stranded RNA or RNA-DNA. A dsRBD motif includes a 
5 region of approximately 70 amino acids which includes basic residues and 
contains a conserved core sequence with a predicted (T-helical structure. 
The dsRBD motif is found in at least 20 known or putative RNA-binding 
proteins from different organisms. There are two types of dsRBDs; Type 
A, which is homologous along its entire length with the defined 
10 consensus sequence, and Type B, which is more highly conserved at its 
C terminus than Its N terminus. These domains have been functionally 
delineated in specific proteins by deletion analysis and RNA binding 
assays (St Johnston, et al., Proc, Natl. Acad. Sci., 89:10979-10983 
(1992)). 

15 Any RNA binding sequence/RNA sequence pair can be designed, 

screened or selected according to the methods known in the art including 
the methods described in Section L.2, above and the methods, such as 
those described in U.S. Patent Nos. 5,834,184 and 5,859,227, and in 
SenGupta et a!., A three-hybrid system to detect RNA-protein interactions 

20 in vivo, Proc. Nat. Acad. Sci. U.S.A., 93:8496-8501 (1996)). 

For example, U.S. Patent No. 5,834,184 describes a method of 
screening a plurality of polypeptides for RNA binding activity. The 
method includes the steps of: (1) culturing a library of procaryotic cells 
that constitute a library, and (2) detecting expression of the reporter gehe 

25 in a cell from the library, the expression indicating that the cell comprises 
a polypeptide having RNA binding activity. The cells contain at least one 
vector that contains a first DNA segment that encodes a fusion protein of 
a prokaryotic anti-terminator protein having anti-terminator activity linked 
in-frame to the test polypeptide, which varies among the cells in the 

30 library, that is operably linked to a second DNA segment. The second 
DNA segment contains a promoter, an RNA recognition sequence foreign 
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to the anti-terminator protein, a transcription termination site and a 
reporter gene. The termination site blocks transcription of the reporter 
gene in the absence of a protein with anti-termination activity and affinity 
for the RNA recognition sequence. If the test polypeptide has specific 
5 affinity for the recognition sequence, it binds via the polypeptide to the 
RNA recognition sequence of a transcript from the second DNA segment 
thereby inducing transcription of the second DNA segment to proceed 
through the termination site to the reporter gene resulting in expression of 
the reporter gene, 

10 U.S. Patent No. 5,859,227 describes methods for identifying 

possible binding sites for RNA binding proteins in nucleic acid molecules, 
and confirming the identity of such prospective binding sites by detection 
of interaction between the prospective binding site and RNA binding 
proteins. These methods .involve identification of possible binding sites 

15 for RNA binding proteins, by either searching databases for untranslated 
regions of gene sequences or cloning untranslated sequences using a 
single specific primer and an universal primer, followed by confirmation 
that the untranslated regions in fact interact with RNA binding proteins 
using the RNA/RBP detection assay. Genomic nucleic acid can further be 

20 screened for putative binding site motifs in the nucleic acid sequences. 
Information about binding sites that are confirmed in the assay then can 
be used to redefine or redirect the nucleic acid sequence search criteria, 
for example, by establishing or refining a consensus sequence for a given 
binding site motif. 

25 SenGupta et a!., Proc. Nat Acad. ScL U.S.A., 93:8496-8501 

(1996) describes.a yeast igenetic method to detect and analyze 
RNA-protein interactions in which the binding of a bifunctional RNA to 
each of two hybrid proteins activates transcription of a reporter gene in 
vivo {see also Wang et al., Genes & Dev., 10:3028-3040 (1996)). 

30 SenGupta et al. demonstrate that this three-hybrid system enables the 
rapid, phenotypic detection of specific RNA-protein interactions. As 
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examples, SenGupta et al. use the binding of the iron regulatory protein 1 
(IRP1) to the iron response element (IRE), and of HIV trans-activator 
protein (Tat) to the HIV trans-activation response element (TAR) RNA 
sequence. The three-hybrid assay relies only on the physical properties of 
5 the RNA and protein, and not on their natural biological activities; as a 
result, it may have broad application in the identification of RNA-binding 
proteins and RNAs, as well as in the detailed analysis of their 
interactions. 

The following Table 5 illustrates* certain properties of several RNA 
10 binding sequence/RNA sequence pair systems. 

Table 5. Examples of RNA binding sequence/RNA sequence pairs 



RNA binding 
sequence 


RNA binding 
sequence motif 


RNA sequence 


Reference (U.S. 
Patent No.) 


BINDR 


double-stranded 
RNA-binding 


double-stranded RNA 
poly(rl) and poly (rC) 


5,858,675 


Protein extract from 
SH-SY5Y cells 


5' untranslated ' 
region (UTR) 


UTR of Gluti (SEQ 
ID NO. 19); 5' UTR 
of (HMG,CoA Red) 
(SEQ ID NO. 20); 5' 
UTR of human 
C4b-binding a chain 
(SEQ ID NO. 21); 5' 
UTR of human CD45 
(SEQ ID NO. 22) 


5,859,227 



3) Preparation of nucleic acid binding proteins 
Extracts prepared from the isolated nuclei of cultured cells are 

20 functional in accurate in vitro transcription and mRNA processing {See 
generally. Current Protocols in Molecular Biology (1998) § 12.1., John 
Wiley & Sons, Inc.). Thus,, such extracts can be used directly for 
functional studies and as the starting material for purification of the 
proteins involved in these processes. To prepare nuclear extracts, tissue 

25 culture cells are collected, washed, and suspended in hypotonic buffer. 
The swollen cells are homogenized and nuclei are pelleted. The 
cytoplasmic fraction is removed and saved, and nuclei are resuspended in 
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a low-salt buffer. Gentle dropwise addition of a high-salt buffer then 
releases soluble proteins from the nuclei (without lysing the nuclei). 
Following extraction, the nuclei are removed by centrifugation, the 
nuclear extract supernatant is dialyzed into a moderate salt solution, and 
5 any precipitated protein is removed by centrifugation. 

The nuclear and cytoplasmic extraction procedure (see, e.g., 
Dignam et al., 1983, Nucl. Acids. Res. 11:1475-1489 (Accurate 
transcription initiation by RNA polymerase II in a soluble extract from 
Isolated mammalian nuclei); Dignam, et al., 1983, Methods Enzymol. 

10 101:582-598 (Eukaryotic gene transcription with purified components); 
Krainer, et aL, 1984, Ce// 36:993-1005 (Normal and mutant human ^- 
globin pre-mRNAs are faithfully and efficiently spliced in vitro); Lue, et al, 
1987, Proc. Natl. Acad. ScL U.S.A. 84:8839-8843 (Accurate initiation at 
RNA polymerase II promoters in extracts from Saccharomyces cerevisiae); 

15 Manley, et al., 1980, Proc. Natl. Acad. Sci. U.S.A. 77:3855-3859 (DNA- 
dependent transcription of adenovirus genes in a soluble whole-cell 
extract); Weil, et al., 1979, J. Biol. Chem. 254:6163-6173 (Faithful 
transcription of eukaryotic genes by RNA polymerase III in systems 
reconstituted with purified DMA templates); and Weil, et al., 1979, Cell 

20 18:469-484 (Selective and accurate initiation of transcription at the Ad2 

major late promoter in a soluble system dependent on purified RNA 

polymerase II and DNA)) and the identified protein-DNA interaction pairs 

can be used in the present system. 

4) Assays for identifying nucleic acid binding 
25 proteins 

a) l\/lobility shift DNA-binding assay 

The DNA-binding assay using nondenaturing polyacrylamide gel 

electrophor(8sis (PAGE) provides a simple, rapid, and extremely sensitive 

method for detecting sequence-specific DNA-binding proteins (See , 

30 generally. Current Protocols in Molecular Biology 998) § 12.2., John 

Wiley & Sons, Inc.). Proteins that bind specifically to an end-labeled DNA 
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fragment retard the mobility of the fragment during electrophoresis, 
resulting in discrete bands corresponding to the individual protein-DNA 
complexes. The assay can be used to test binding of purified proteins or 
of uncharacterized factors found in crude extracts. This assay also 
5 permits quantitative determination of the affinity, abundance, association 
rate constants, dissociation rate constants, and binding specificity of 
DNA-binding proteins. 

b) Basic mobility shift assay procedure 

The basic mobility shift assay procedure includes 4 steps: (1) 
10 preparation of a radioactively labeled DNA probe containing a particular 
protein binding site; (2) preparation of a nondenaturing gel; (3) a binding 
reaction in which a protein mixture is bound to the DNA probe; and (4) 
electrophoresis of protein-DNA complexes through the gel, which is then 
dried and autoradiographed. The mobility of the DMA-bound protein is 
1 5 retarded while that of the non-bound protein is not retarded. 

c) Competition mobility shift assay 

One important aspect of the mobility shift DNA-binding assay is the 
ease of assessing the sequence specificity of protein-DNA interactions 
using a competition binding assay. This is necessary because most 

20 protein preparations will contain specific and nonspecific DNA binding 
proteins. For a specific competitor, the same DNA fragment (unlabeled) 
as the probe can be used. The nonspecific competitor can be essentially 
any fragment with an unrelated sequence, but it is useful to roughly 
match the probe and specific competitor for size and configuration of the 

25 ends. For example, some proteins bind blunt DNA ends nonspecifically. 
These would not be competed by circular plasmid or a fragment with 
overhands, leading to the false conclusion that the protein-DNA complex 
represented specific binding. Perhaps the best control competitor is a 
DNA fragment that is identical to the probe fragment except for a 

30 mutation(s) in the binding site that is known to disrupt function (and 
presumably binding). 
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d) Antibody supershift assay 
Another useful variation of the mobility shift DNA-binding assay is 
to use antibodies to identify proteins present in the protein-DNA complex. 
Addition of a specific antibody to a binding reaction can have one of 
5 several effects. If the protein recognized by the antibody -is not involved 
in complex formation, addition of the antibody should have no effect. If 
the protein that forms the complex is recognized by the antibody, the 
antibody can either block complex formation, or it can form an antibody- 
protein-DNA ternary complex and thereby specifically result in a further 

10 reduction in the mobility of the protein-DNA complex (supershift). 

Results may be different depending upon whether the antibody is added 
before or after the protein binds DNA (particularly if there are epitopes on 
the DNA-binding surface of the protein). 

The mobility shift DNA-binding assay has been successfully 

15 employed (see, e.g., Carthew, et al„ 1985, Cell 43:439-448 (An RNA 
polymerase II transcription factor binds to an upstream element in the 
adenovirus major late promoter); Chodosh, et a!., 1986, MoL Cell. Biol. 
6:4723-4733 (A single polypeptide possesses the binding and activities 
of the adenovirus major late transcription factor); Fried, et al., 1981, 

20 NucL Acids. Res., 9:6505-6525 (Equilibria and kinetics of lac represser- 
operator interactions by polyacrylamide gel electrophoresis); Fried, et aL, 
1984, J. Mol. BioL VJlil^^-lQl (Kinetics and mechanism in the reaction 
of gene regulatory proteins with DNA); Fried, et al., 1984, J. Mol. Biol. 
172 :263-282 (Equilibrium studies of the cyclic AMP receptor protein-DNA 

25 interaction); Garner, et al., 1981, Nucl. Acids Res. 9:3047-3060 (A gel 
electrophoresis method for quantifying the binding of proteins to specific 
DNA regions: Application to components of the Escherichia coll lactose 
operon regulatory system); Hendrickson, et al., 1984, J. Mol. BioL 
174 :61 1-628 (Regulation of the Escherichia coli L-arabinose operon 

30 studied by gel electrophoresis DNA binding assay); Kristie, et al., 1 986, 
Proc. Natl. Acad. Sci. U.S.A. 83:3218-3222 (The major regulatory 
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protein of herpes simplex virus type 1 , is stably and specifically 
associated with promoter-regulatory domains of a genes and/or selected 
viral genes); Lieberman, et al., 1994, Genes & Dev. 8:995-1006 (A 
mechanism for TAFs in transcriptional activation: Activation domain 
5 enhancement of TFIID-TFIIA-promoter DNA complex formation); Riggs, et 
al., 1970, J. MoL BioL 48:67-83 (Lac represser-operator interactions: I. 
Equilibrium studies); Singh, et al,, 1986, Nature 319 :154-158 (A nuclear 
factor that binds to a conserved sequence motif in transcriptional control 
elements of immunoglobulin genes); Staudt, et al., 1986, Nature 

10 323 :640-643 (A lymphoid-specific protein binding to the octamer motif 
of immunoglobulin genes); Strauss, et al., 1984, Cell 37:889-901 (A 
protein binds to a satellite DNA repeat at three specific sites that would 
be brought into mutual proximity by DNA folding in the nucleosome); and 
Zinkel, et al., 1987, Nature 328 :178-181 (DNA bend direction by phase- - 

1 5 sensitive detection)) and the identified protein-DNA interaction pairs can 
be used in the present system. 

e) Midthylation and uracil interference assay 
Interference assays identify specific residues in the DNA binding 
site that, when modified, interfere with binding of the protein (See 

20 generally. Current Protocols in Molecular Biology {^ 998) § 12.3., John 
Wiley & Sons, Inc.). These protocols use end-labeled DNA probes that 
are modified at an average of one site per molecule of probe. These 
probes are incubated with the protein of interests, and protein-DNA 
complexes are separated from free probe by the mobility shift assay. A • 

25 DNA probe that is modified at a position that interferes with binding will 
not be retarded in this assay; thus, the specific protein-DNA complex is 
depleted for DNA that contains modifications on bases important for 
binding. After gel purification the bound and unbound DNA are 
specifically cleaved at the modified residues and the resulting products 

30 analyzed by electrophoresis on polyacrylamide sequencing gels and 

autoradiography. These procedures provide complementary information 
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about the nucleotides involved in protein-DNA interactions, 

1) Methylation interference assays 
In methylation interference, probes are generated by methylating 

guanines {at the N-7 position) and adenines (at the N-3 position) with 
5 DMS; these methylated bases are cleaved specifically by piperidine. 
IVIethylation interference identifies guanines and adenines in the DNA 
binding site that, when methylated, interfere with binding of the protein. 
The protocol uses a single end-labeled DNA probe that is methylated at 
an average of one site per molecule of probe. The labeled probe is a 

10 substrate for a protein-binding reaction. DNA-protein complexes are 

separated from the free probe by the mobility shift DNA-binding assay. A 
DNA probe that is methylated at a position that interferes with binding 
will not be retarded in this assay. Therefore, the specific DNA-protein 
complex is depleted for DNA that contains methyl groups on purines 

15 important for binding. After gel purification, DNA is cleaved with 
piperidine. Finally, these fragments are electrophoresed on 
polyacrylamide sequencing gels and autoradiographed. Guanines and. 
adenines that interfere with binding are revealed by their absence in the 
retarded complex relative to a lane containing piperidine-cleaved free 

20 probe. This procedure offers a rapid and highly analytical means of 
characterizing DNA-protein interactions. 

2) Uracil interference assay 

In uracil interference, probes are generated by PGR arhplification in 
the presence of a mixture of TTP and dUTP, thereby producing products 

25 in which thymine residues are replaced by deoxyuracil residues (which 
contains hydrogen in place of the thymine 5-methyl group). Uracil bases 
are specifically cleaved by uracil-yV-glycosylase to generate apyrimldinic 
sites that are susceptible to piperidine. Uracil interference identifies 
thymines in a DNA binding site that, when modified, interfere with 

30 binding of the protein. Probes generated by PGR amplification in the 
presence of TTP and dUTP incorporate deoxyuracil in place of thymine 
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residues. PCR products are incubated with the binding protein and 
resulting complexes are separated from unbound DNA. The DNA 
recovered from the protein-DNA complex is treated with uracil-N- 
glycosylase and piperidine, and the products are then electrophoresed on 
5 a denaturing polyacrylamide gel. 

The methylation and uracil interference assays have been 
successfully used (see, e.g., Baldwin, et al., 1988, Proc. NatL Acad. Sci. 
U.S.A. 85:723-727 (Two transcription factors, H2TF1 and NF-kB, 
interact with a single regulatory sequence in the class I MHC promoter); 

10 Brunelle, et al., 1987, Proc. NatL Acad. Sci. U.S.A. 84:6673-6676 
(Missing contact probing of DNA-protein interactions); Goeddel, et al., 
1978, Proc. NatL Acad. ScL U.S.A. 75:3579-3582 (How lac repressor 
recognizes lac operator); Ivarie, et al., 1987, NucL Acids Res. 15:9975- 
9983 (Thymine methyls and DNA-protein interactions); Maxam, et al., 

15 1980, Methods Enz/mo/ 65:499-560 (Sequencing end-labeled DNA with 
base-specific chemical cleavages); Pu, et al., 1992, NucL Acids Res. 
20:771-775 {Uracil Interference, a rapid and general method for defining 
protein-DNA interactions involving the 5-methyl group of thymines: The 
GCN4-DNA complex); Siebenlist, et al., 1980, Proc. NatL Acad. ScL 

20 U.S.A. 77:1 22-126 (Contacts between E. coli RNA polymerase and an 
early promoter of phase T7); and Hendrickson, et al., 1985, Proc. NatL 
Acad. ScL U.S.A. 82:3129-3133 (A dimer of AraC protein contacts three 
adjacent major groove regions at the Ara I DNA site)) and the identified 
protein-DNA interaction pairs can be used in the present system. 

25 3) DNase I footprint analysis 

Deoxyribonuclease I (DNase I) protection mapping, or footprinting, 
is a valuable technique for locating the specific binding sites of proteins 
on DNA (See generally, Currerit Protocols in Molecular Biology (1 998) § 
12.4., John Wiley & Sons, Inc.). The basis of this assay is that bound 

30 protein protects that phosphodiester backbone of DNA from DNase I 
catalyzed hydrolysis. Binding sites are visualized by autoradiography of 
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the DNA fragments that result form hydrolysis, following separation by 
electrophoresis on denaturing DNA sequencing gels. Footprinting has 
been developed further as a quantitative technique to determine separate 
binding curves for each individual protein-binding site on the DNA. For 
5 each binding site, the total energy of binding is determined directly from 
that site's binding curve. For sites that interact cooperatively, 
simultaneous numerical analysis of all the binding curves can be used to 
resolve the intrinsic binding and cooperative components of these 
energies. 

10 DNase I footprint analysis has been successfully employed (see, 

e.g., Ackers, et al., 1982, Proc. Nat/. Acad, ScL U.S.A. 79:1129-1133 
(Quantitative model for gene regulation by lambda phage repressor); 
Ackers, et al., 1983, J. MoL Biol. 170:223-242 (Free energy coupling 
within macromolecules: The chemical work of ligand binding at the 

15 individual sites in cooperative systems); Brenowitz, et aL, 1986, Proc. 
Natl. Acad. Sci. U.S.A. 83:8462-8466 (Footprint titrations yield valid 
thermodynamic isotherms.); Brenowitz, et ai., 1986, Meth. Enzymol. 
130:132-181 (Quantitative DNase I footprint titration: A method for 
studying protein-DNA interactions); Dabrowiak, et al., 1989, In Chemistry 

20 and Physics of DNA-Ligand Interactions {N,R. Kallenback, ed.) Adenine 
Press. (Quantitative footprinting analysis of drug-DNA interactions); 
Galas, et al., 1978, Nucl. Acids Pes. 5:3157-3170 (DNase footprinting: A 
simple method for the detection of protein-DNA binding specificity); 
Hertzberg, et al., 1982, J. Am. Chem. Soc. 104:313-315 (Cleavage of 

25 double helical DNA by (methidiumpropyl-EDTA) iron (II)); Johnson, et a!., 
1979, Proc. Natl. Acad. Sci. U.S.A. 76:5061-5065 (Interactions between 
DNA-bound repressors govern regulation by the lambda phage repressor); 
Johnson, et al., 1985, Meth. Enzymol. 117 :301-342 (Nonlinear least- 
squares analysis); Senear, et al., 1986, Biochemistry 25:7344-7354^ 

30 (Energetics of cooperative protein-DNA interactions: Comparison between 
quantitative DNase I footprint titration and filter binding); and Tullius, et 
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al., 1987, Meth. Enzymol. 155 :537-558 {Hydroxyl radical footprinting: A 
high resolution method for mapping protein-DNA contacts), and the 
identified protein-DNA interaction pairs can be used in the present 
system. 

5 4) Screening a >)gt11 expression 

library with recognition-site DNA 

A clone encoding a sequence-specific protein can be detected in a 

>tgt1 1 library because its recombinant protein binds specifically to a 

radiolabeled recognition-site DNA {See generally. Current Protocols in 

10 Molecular Biology (1998) § 12.7., John Wiley & Sons, Inc.). 

Bacteriophage from a cDNA library constructed in the vector ><gtl 1 are 
plated under lytic growth conditions. After plaques appear, expression of 
the )ff-galactosidase fusion proteins encoded by the recombinant phage is 
induced by placing nitracellulose filters impregnated with IPTG onto the 

1 5 plate. Phage growth is continued and is accompanied by the 

immobilization of proteins, from lysed cells, onto the nitrocellulose filters. 
The filters are lifted after this incubation, blocked with protein, then 
reacted with a radiolabeled recognition-site DNA (containing one or more 
binding sites for the relevant sequence-specific protein) in the presence of 

20 an excess of nonspecific competitor DNA. After the binding reaction, the 
filters are washed to remove nonspecifically bound probe and processed 
for autoradiography. Potentially positive clones detected in the primary 
screen are rescreened after a round of plaque purification. Recombinants 
which screen positively after enrichment and whose detection specifically 

25 requires the recognition-site probe (non detected with control probes 
lacking the recognition site for the relevant protein) are then isolated by 
further rounds of plaque purification. 

The /Igtl 1 expression screening methods have been successfully 
used (see, e.g., Androphy, et al,, 1987, Nature (LondJ 325 :70-73 

30 (Bovine papillomavirus E2 trans-activating gene product binds to specific 
. sites in papillomavirus DNA); Arndt, et aL, 1986, Proc. Natl. Acad. ScL 
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U.S.A. 83:8516-8520 (GCN4 protein, a positive transcription factor in 
yeast binds general control promoters at 5TGACTC3' sequences); 
Chodosh, et aL, 1988, Ce// 53:25-35 (A yeastand a human CCAAT- 
binding protein have heterologous subunits that are functionally 
5 interchangeable); Desplan, et al., 1985, Nature (LondJ 318 :630-635 
(The Drosophila developmental gene, engrailed, encodes a sequence- 
specific DNA binding activity); Hoeffler, et al., 1988, Science 242 :1430- 
1433 (Cyclic AMP-responsive DNA-binding protein: Structure based on a 
cloned placental cDNA); Hsiou-Chi, et al., 1988, Science 242 :69-71 

10 (Distinct cloned class II MHC DNA binding proteins recognize the X box 
transcription element); Ingraham, et al., 1988, Cei/ 55:519-529 (A tissue- 
specific transcription factor containing a homeo domain specifies a 
pituitary phenotype); Kadonaga, et al., 1987, Ce// 51:1079-1090 
(Isolation of cDNA encoding transcription factor Spl an functional 

15 analysis of the DNA binding domain); Keegan, et al., 1986, Science 
231:699-704 (Separation of . DNA binding from the transcription- 
ac'tivating function of a eukaryotic regulatory protein); Miyamoto, et al., 
1988, Ce/I 54:903-913 (Regulated expression of a gene encoding a 
nucleic factor, IRF-1, that specifically binds to IFN-y? gene regulatory 

20 elements); Murre, et al., 1989, Ce//'56:777-783 (A new DNA binding and 
dimerization motif in immunoglobulin enhancer binding, daughterless, 
MyoD and myc proteins); Muller, et al., 1988, Nature (Lond.) 336 :544- 
551 (A cloned octamer transcription factor stimulates transcription from 
lymphoid specific promoters in non-B cells); Rawlins, et al., 1985, Cell 

25 42:859-868 (Sequence-specific DNA binding of the Epstein-Barr viral 
nuclear antigen (EBNA-1) to clustered sites in the plasmid maintenance 
region); Reith, et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:4200-4204 
(Cloning of the major histocompatibility complex class II promoter 
affected in a hereditary defect in class II gene regulation); Singh, et al., 

30 1988, Ce// 52:415-423 (Molecular cloning of an enhancer binding 

protein: Isolation by screening, of an expression library with a recognition 
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site); Staudt et al., 1988, Science 241 :577-580 (Molecular cloning of a 
lymphoid-specific cDNA encoding a protein that binds to the regulatory 
octanner DNA motif); Sturm, et al., 1988, Genes & Dev. 2:1582-1599 
(The ubiquitous octamer protein Oct-1 contains a Pou domain with a 
5 homeo subdomain); Vinson, et al., 1988, Genes & Dev. 2:801-806 {/n 
situ detection of sequence-specific DNA binding activity specified by a 
recombinant bacteriophage); Weinberger, et al., 1 985, Science 228 :740- 
742 (Identification of human glucocorticoid receptor complementary DNA 
clones by epitope selection); and Young, et al., 1983, Science 222 :778- 
10 782 (Yeast RNA polymerase II genes: Isolation with antibody probes)) and 
the identified protein-DNA interaction pairs can be used in the present 
system. 

5) Rapid separation of protein-bound 
DNA from free DNA 

1 5 This method relies on the ability of nitrocellulose to bind proteins 

but not double-stranded DNA (See generally. Current Protocols in 
Molecular Biology (1998) § 12.8., John Wiley & Sons, Inc.). Use of 
radioactively labeled double-stranded DNA fragments allows quantitation 
of DNA bound to the protein at various times and under various 

20 conditions, permitting kinetic and equilibrium studies of DNA-binding 
interactions. Purified protein is mixed with double-stranded DNA in an 
appropriate buffer to allow interaction. After incubation, the mixture is 
suction filtered through nitrocellulose, allowing unbound DNA to pass 
through the filter while the protein (and any DNA interacting with it) is 

25 retained. 

Nitrocellulose filter methods have been successfully used (see, 
e.g., Berkley, et al., 1975, Biochemistry V^.MOO-MM (Interaction of 
effecting ligands with lac repressor and repressor-operator complex); 
Fried, et al., 1981, Nucl. Acids Res. 9:6505-6525 (Equilibria and kinetics 
30 of lac repressor-operator interactions by polyacrylamide gel 

electrophoresis); Hinkle, et al., 1972, J. Mol. Biol. 70:157-185 (Studies 
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of the binding of Escherichia coli RNA polymerase to DNA I. The role of 
Sigma subunit in site selection); Hinkle, et al., 1972, J. i\/loL Biol. 70:187- 
1 95 (Studies of the binding of Escherichia coli RNA polymerase to DNA 11. 
The kinetics of the binding reaction); Hinkle, et al., 1972, J. Mol. Biol. 
5 70:197-207 (Studies of the binding of Escherichia coli RNA polymerase 
to DNA III. Tight binding of RNA polymerase holoenzyme to single-strand 
breaks in T7 DNA); Jones, et al., 1966, J, Mol. BioL 22:199-209 
(Studies on the binding of RNA polymerase to polynucleotides); Lin, et 
al., 1972, J. Mol. Biol. 72:671-690 (Lac repressor binding to non- 
10 operator DNA: Detailed studies and a comparison of equilibrium and rate 
competition methods); Lin, et al., 1975, Ce// 4:107-1 11 (The general 
affinity of lac repressor for E coli DNA: Implications for gene regulation in 
procaryotes and eucaryotes); Nirenberg, et al., 1964, Science 145 :1399- 
1407 (RNA codewords and protein synthesis: The effect of trinucleotides - 
15 upon the binding of sRNA to ribosomes); Ptashne, et al., 1987, A Genetic 
Switch: Gene Control and Phage A pp. 80-83 and 109-1 18. Cell Press, 
Cambridge, MA and Blackwell Scientific, Boston, MA; Riggs, et al., 1970, 
J. Mol. Biol. 48:67-83 {Lac repressor-operator interactions: I. Equilibrium 
studies); Strauss, et al., 1980, Biochemistry 19:3496-3504 (Binding of' 
20 Escherichia coli ribonucleic acid polymerase holoenzyme to a 

bacteriophage T7 promoter-containing fragment: Selectivity exists over a 
wide range of solution conditions); Strauss, et al., 1980, Biochemistry 
19:3504-3515 (Binding of Escherichia coli ribonucleic acid polymerase 
holoenzyme to a bacteriophage T7 promoter-containing fragment: 
25 Evaluation of promoter binding constants as a function of solution 
conditions); and Strauss, et al., 1981, Gene 13:75-87 (Variables 
affecting the selectivity and efficiency of retention of DNA fragments by 
£ coli RNA polymerase in the nitrocellulose-filter binding assay)) and the 
identified protein-DNA interaction pairs can be used in the present 
30 system. 
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f . Lipid binding moieties 
The conjugate can also contain a lipid binding protein, peptide or 
effective fragment tliereof . . Its specific binding partner can be lipids 
generally, a set of lipids or a particular lipid. Any lipid binding moiety, 
5 particularly proteins, peptides or effective fragments tiiereof can be used 
in the present system. For example, the lipid binding protein can bind to 
a triacylglycerol, a wax, a phosphoglyceride, a sphingolipid, a sterol and a 
sterol fatty acid ester. More preferably, the lipid binding sequence 
comprises a C2 motif or an amphipathic a-helix motif. 

10 Any lipid binding sequence/lipid pair can be designed, screened or 

selected according to the methods known in the art (see, e.g., Kane et 
aL, Anal. Biochem,, 233121:197-204 (1996); Arnold et a!., Biochim. 
Biophys. Acta, 1233121:198-204 (1995); IVIiller and Cistola, Mol. Cell. 
Biochem., 123(l-2) :29-37 (1993); and Teegarden et al., Anal. Biochem., • 

15 199121:293-9 (1991). 

For example, Kane etaL, Anal. Biochem., 233121:197-204 (1996) 
describes that the fluorescent probe 1 -anilinonapthalene 8-sulfonic acid 
(1 ,8-ANS) has been used to characterize a general assay for members of 
the intracellular lipid-binding protein .(iLBP) multigene family. The 

20 adipocyte lipid-binding protein (ALBP), the Iceratinocyte lipid-binding 

protein (KLBP), the cellular retinol-binding protein (CRBP), and the cellular 
retinoic acid-binding protein i (CRABPI) have been characterized as to 
their ligand binding activities using 1,8-ANS. ALBP and KLBP exhibited 
the highest affinity probe binding with apparent dissociation constants 

25 (Kd) of 410 and 530 nM, respectively, while CRBP and CRABPI bound 
1,8-ANS with apparent dissociation constants of 7.7 and 25 microM, 
respectively. In order to quantitate the fatty acid and retinoid binding 
specificity and affinity of ALBP, KLBP, and CRBP, a competition assay 
was developed to monitor the ability of various lipid molecules to displace 

30 bound 1 ,8-ANS from the binding cavity. Oleic acid and arachidonic acid 
displaced bound 1,8-ANS from ALBP, with apparent inhibitor constants 
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(Ki) of 134 nM, while all-trans-retinoic acid exiiibited a seven-fold lower 
Ki {870 nM). The short chain fatty acid octanoic acid and all-trans-retinol 
did not displace the fluorophore from ALBP to any measurable extent. In 
comparison, the displacement assay revealed that KLBP bound oleic acid 
5 and arachidonic acid with high affinity (Ki = 420 and 400 nM, 

respectively) but bound all-trans-retinoic acid with a markedly reduced 
affinity (Ki = 3.6 microM). Like that for ALBP, neither octanoic acid nor 
all-trans-retinol were bound by KLBP. Displacement of 1,8-ANS from 
CRBP by all-trans-retinal and all-trans-retinoic acid yielded Ki values of 1 .7 
10 and 5.3 microM, respectively. These results indicate the utility of the 
assay for characterizing the iigand binding characteristics of members of 
the iLBP family and suggests that this technique may be used to 
characterize the Iigand binding properties of other hydrophobic Iigand 
binding proteins. 

1 5 Arnold et al., Biochim. Biophys. Acta, 1233(2) : 1 98-204 (1 995) 

describes an assay for analyzing the specific binding of proteins to lipid 
ligands contained within vesicles or micelles. This assay, referred to as 
the electrophoretic migration shift assay, was developed using a model 
system composed of cholera toxin and of its physiological receptor, 

20 monosialoganglioside GM1 . Using polyacrylamide gel electrophoresis in 
non-denaturing conditions, the migration of toxin components known to 
interact with G Ml was retarded when GM1 was present in either lipid 
vesicles or micelles. This effect was specific, as the migration of proteins 
not interacting with GM1 was not modified. The localization of retarded 

25 proteins and of lipids on gels was further determined by autoradiography. 
The stoichiometry of binding between cholera toxin and GM1 was 
determined, giving a value of five GM1 per one pentameric assembly of 
cholera toxin B-subunits, in agreement with previous studies. The general 
applicability of this assay was further established using streptavidin and 

30 annexin V together with specific lipid ligands. This assay is fast, simple, 
quantitative, and requires only microgram quantities of protein. 
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Miller and Cistola, Mol. Cell. Biochem., 123(1'2) :29-37 (1993) 
teaches that titration calorimetry can be used as a method for obtaining 
binding constants and thermodynamic parameters for the cytosolic fatty 
acid- and lipid-binding proteins. A feature of this method is its ability to 
5 accurately determine binding constants in a non-perturbing manner. This 
is achieved because the assay does not require separation of bound and 
free ligand to obtain binding parameters. Also, the structure of the 
lipid-protein complex was not perturbed, since native ligands were used 
rather than non-native analogues. As illustrated for liver fatty 

10 acid-binding protein, the. method distinguished affinity classes whose 
dissociation constants differed by an order of magnitude or less. It also 
distinguished endothermic from exothermic binding reactions, as 
illustrated for the binding of two closely related bile salts to ileal 
lipid-binding protein. The main limitations of the method are its relatively 

1 5 low sensitivity and the difficulty working with highly insoluble ligands, 
such as cholesterol or saturated long-chain fatty acids. The 
signal-to-noise ratio was improved by manipulating the buffer conditions, 
as illustrated for oleate binding to rat intestinal fatty acid binding protein. 
Teegarden et al., AnaL Biochem., 199(2) :293-9 (1991) describes 

20 an assay for measurement of the affinity of serum vitamin D binding 

protein for 25-hydroxyvitamin D3, 1 ,25-dihydroxyvitamin D3, and vitamin 
D3, using uniform diameter (6.4 microns) polystyrene beads coated with 
phosphatidylcholine and vitamin D metabolites as the vitamin D donor. 
The lipid metabolite coated beads have a solid core, and thus all of the 

25 vitamin D metabolites are on the bead surface from which transfer to 
protein occurs. After incubating these beads in neutral buffer for 3 h, 
essentially no ^H-labeled vitamin D metabolites desorb from this surface. 
Phosphatidylcholine/vitamin D metabolite-coated beads (1 microM vitamin 
D metabolite) were incubated with varying concentrations of serum 

30 vitamin D binding protein under conditions in which the bead surfaces 
were saturated with protein, but most of the protein was free in solution. 
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After incubation, beads were rapidly centrifuged without disturbing the 
equilibrium of binding and vitamin D metabolite bound to sDBP in solution 
was assayed in the supernatant. All three vitamin D metabolites became 
bound to serum vitamin D binding protein, and after 10 min of incubation 
5 the transfer of the metabolites to serum vitamin D binding protein was 
time independent. The transfer followed a Langmuir isotherm, and the Kd 
for each metabolite binding to serum vitamin D binding protein was 
derived by nonlinear least-squares fit analysis. From this analysis the 
following values for the Kd were obtained: 5.59 x 10"® 

10 25-hydroxyvitamin D; 9.45 x ^0'^ M, 1 ;25-dihydroxyvitamin D; and 9.17 
X 1 6"^ M, vitamin D. The method disclosed herein avoids problems 
encountered in previous assays and allows the precise and convenient 
determination of binding affinities, of vitamin D metabolites and serum 
vitamin D binding protein. 

1 5 In addition, known protein/lipid binding pairs can be used in the 

methods and with the products provided herein (see, e.g., Hinderliter et 
al., Biochim. Biophys. Acta, 1448(2) :227-35 (1998) (C2 motif binds 
phospholipid in a manner that is modulated by Ca2+ and confers 
membrane-binding ability on a wide variety of proteins, primarily proteins 

20 involved in signal transduction and membrane trafficking events); 

Campagna et al., J. Diary Sci., 81 (1 2) :31 39-48 (1998) (an amphipathic 
helical lipid-binding motif of a glycosylated phosphoprotein, component 
PP3 in bovine milk); Chae et al., J. Biol. Chem., 273(40) :25659-63 
(1998) (The C2A domain of synaptotagmin I, which binds Ca2+ and 

25 anionic phospholipids); Johnson et a!., Biochemistry, 37(26) :9509-19 
(1998) (the membrane binding domain of phosphocholine 
cytidylyltransferase (CT) includes a continuous amphipathic alpha-helix 
between residues approximately 240-295 anionic lipids); Kiyosue et aL, 
Plant Mol. fi/oA, 35161:969-72 (1997) {Ca2 + -dependent lipid-binding 

30 domains of cytosolic phospholipase A2, protein kinase C, Rabphilin-3A, 
and Synaptotagmin 1 of animals); Welters et al., Proc. Natl. Acad. Scl. 
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(ySi4, 911241:11398-402 (1994) (calcium-dependent lipid-binding domain 
Is near the N terminus of phosphatidylinositol (PI) 3-kinase cloned from 
Arabidopsis thaliana); and Filoteo et al., J. BioL Chem., 267(1 7) :1 1800-5 
(1992) (Peptide G25: 
5 LysLysAlaValLysValProLysLysGluLysSerValLeuGlnGlyLysLeuThrArgLeuAlaValGlnlle 
(SEQ ID No. 23) representing the putative lipid-binding region (G region) 
of the erythrocyte Ca2+ pump interacted with acidic lipids, as shown by 
the increase in size of phosphatidylserine liposomes in its presence)), 
g. Polysaccharide binding moieties 

10 The conjugate can include a polysaccharide binding protein, peptide 

or effective fragment thereof. Its specific binding partner can be 
polysaccharides generally, a set of polysaccharides or a particular 
polysaccharide. Any polysaccharide binding moiety, such as a protein, 
can be used in the present system and include but are not limited to a 

15 polysaccharide binding sequence that binds to starch, glycogen, cellulose 
or hyaluronic acid. 

Any polysaccharide binding protein/poiysaccharide pair can be 
designed, screened or selected according to the methods known in the 
art including the methods disclosed in Kuo et al., J. Immunol. Methods, 

20 43(1) :35-47 (1981); and Brandt et al., J. Immunol., 108(4) ;913-20 
(1972) (a radioactive antigen-binding assay for Neisseria meningitidis 
polysaccharide antibody). Kuo et al., J, Immunol. Methods, 43(11:35-47 
(1981) provides a polyethylene glycol (PEG) radioimmunoprecipitation 
assay for the detection of antibody to Haemophilus influenza b capsular 

25 polysaccharide, polyribosylribitol phosphate (PRP). The radioactive 

antigen, [^H]PRP, with a high specific activity, was produced by growing 
the organism in the presence of [^HJribose and was purified by 
hydroxylapatite and Sepharose" 4B column chromatography. In the 
assay, PEG (12.5%) was used to separate antibody-bound [^H]PRP from 

30 free I^HIPRP. The assay covered the range of 0-5 and 20 ng 

antibody/assay at a maximum sensitivity of 0.5 approximately 1.0 ng 
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antibody/assay. With various dilutions (1-20 ng antibody/assay) of S. 
Klein reference antiserum, the within-run coefficient of variation (CV) of 
10 replicates ranged from 3.5 to 8.5%, Average CVs of 8,9% and 
1 1 .0% were obtained in the between-run and day-to-day reproducibility 
5 studies. The binding of l^HJPRP to S. Klein reference antiserum was 
severely inhibited by a minute amount of non-radioactive PRP; however, 
no significant interference was found in the presence of high 
concentrations of polysaccharides from Escherichia coli K1 00 and 
Streptococcus pneumoniae indicating that the RIA was highly specific for 

10 antibody to H. influenza b PRP. 

In addition, known protein/polysaccharide binding pairs can be used 
in the methods and with the products provid.ed herein (see, e.g., 
Yamaguchi, et al.. Oral MicrobioL /mmuno/., 13(6) :348-54 (1998) 
(capsule-like serotype-specific polysaccharide antigen lipopolysaccharide 

1 5 from Actinobacillus actinomycetemcomitans/human complement-derived 
opsonins); Lucas, et al., J. Immunol., 161(7) :3776-80 (1998) (kappa II- 
A2 light chain CDR-3 junctional residues in human antibody/Haemophilus 
influenza type b polysaccharide); Miller, et al., Carbohydr. Res., 
309(3) :219-26 (1998) (fragments of the Shigella dysenteriae type 1 O- 

20 specific polysaccharide/monoclonalIgM 3707 E9); Prehm, et al., Protein 
Expr. Purit, 7(4) :343-6 (1996) (digitonin/hyaluronate synthase); Jiang, et 
al,, Infect. Immun., 63(7) :2537-40 (1995) (mannose-binding 
protein/Klebsieila 03 lipopolysaccharide); Pelkonen, et al., J. BacterioL, 
174(231 :7757-61 (1992) (bacteriophage depolymerase/bacterial 

25 polysaccharide); Morishita, et al,, Biochem. Biophys. Res. Commun., 
176(3) :949-57 (1991) (Microbial polysaccharide, HS-142-1/guanylyl 
cyclase-contalning receptor); Ohtomo, et al., Can. J. Microbiol., 
36(3) :206-10 (1990) (staphylococcal cell surface polysaccharlde/human 
fibrinogen); Yamagishi, et al., FEBSLett., 225(1-2) :109-12 (1987) 

30 (heparin or dermatan sulfate/thrombin); DeAngelis, et a!., J. Biol. Chem., 
262(291 :13946-52 (1987) (sulfated fucans/bindin, the adhesive protein 
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from sea urchin sperm); Volanakis, et al., MoL Immunol., 20(111 :1 201-7 
(1983) (human C4/C-reactive protein-pneumococcal C-polysaccharide 
complexes); Naruse, et al., J. Biochem. (Tokyo), 90(3) :581-7 (1981) (a 
polysaccharide from the cortex of sea urchin egg/microtubule-associated 
5 proteins)/ Levy, et al., J, Exp. Med., 1 53(4) ;883-96 (1981) (agaropectin 
and heparin/human IgG proteins); Hu, et al., Biochemistry, 14(10) :2224- 
30 (1975) (glycogen phosphorylase A/a series of semisynthetic, branched 
saccharides); Fagerstrom, M/croA/o/og^/, 140191:^ (1994) (raw- 

starch-binding consensus amino acids in the C-terminai part of 

10 glucoamylase P); Murata, et aL, J. Vet. Med. ScL, 57(3) :419-25 (1995) 
(C-polysaccharide/C-reactive protein (CRP)); Reason, et al,. Infect. 
Immun., 67(2) :994-7 (1999) (Antibodies having light (L) chains encoded 
by the kappall-A2 variable region/Haemophilus influenza type b 
polysaccharide (Hib PS)). 

15 h. Metal binding moieties 

The conjugate can contain a metal binding moiety, such as a metal 
binding protein, peptide or effective fragment thereof. The specific 
binding partner can be metal ions generally, a set of metal ions or a 
particular metal ion. Any metal binding moiety is contemplated. For 

20 example, the metal binding sequence can bind to a sodium, a potassium, 
a magnesium, a calcium, a chlorine, an iron, a copper, a zinc, a 
manganese, a cobalt, an iodine, a molybdenum, a vanadium, a nickel, a 
chromium, a fluorine, a silicon, a tin, a boron or an arsenic ion. 

Any metal binding moiety/metal ion pair can be designed, screened 

25 or selected according to the methods known in the art including the 

methods disclosed in U.S. Patent No. 5,679,548; Kang et al.. Virus Res., 
49(2) : 147-54 (1997); Dealwis et aL, Biochemistry, 34(43) : 13967-73 
(1995)*; and Hutchens et al., J. Chromatogr., 604(1): 125-32 (1992). 
U.S. Patent No. 5,679,548 discloses a method for producing a 

30 metal binding site in a polypeptide capable of binding a preselected metal 
ion-containing molecule, the step of inducing mutagenesis of a 
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complementarity determining region (CDR) of an immunoglobulin heavy or 
light chain gene, where mutagenesis introduces a metal binding site, by 
amplifying the CDR of the gene by a primer extension reaction using a 
primer oligonucleotide, the oligonucleotide comprising: a) a 3' terminus 
5 and a 5' terminus comprising; b) a nucleotide sequence at the 3' terminus 
complementary to a first framework region of the heavy or light chain 
immunoglobulin gene; c) a nucleotide sequence at the 5' terminus 
complementary to a second framework region of the heavy or light chain 
immunoglobulin gene; and d) a nucleotide sequence between the 3' 

10 terminus and 5' terminus according to the formula; [NNSlg, wherein N is 
independently any nucleotide, S is G or C, and a is from 3 to about 50, 
and the 3' and 5' terminal nucleotide sequences having a length of about 
6 to 50 nucleotides, and sequences complementary thereto. 

U.S. Patent No. 5,679,548 also describes a method for producing " 

15 a metal binding site in a polypeptide capable of binding a preselected 
metal ion-containing molecule, the step of inducing mutagenesis of a 
complementarity determining region (CDR) of an immunoglobulin heavy or 
light chain gene by amplifying the CDR of the gene by a primer extension 
reaction using a primer oligonucleotide, the oligonucleotide comprising: a) 

20 a 3' terminus and a 5' terminus; b) a nucleotide sequence at the 3' 

terminus complementary to a first framework region of the heavy or light 
chain immunoglobulin gene; c) a nucleotide sequence at the 5' terminus 
complementary to a second framework region of the heavy or light chain 
immunoglobulin gene;, and d) a nucleotide sequence between 3' terminus 

25 and 5' terminus according to the formula: -X-INNK]a-X-[NNK]-X, wherein 
N is independently any nucleotide, K is G or T, X Is a trinucleotide 
encoding a native aniino acid residue coded by the immunoglobulin gene 
and a is from 3 to about 50, and the 3' and 5' terminal nucleotide 
sequences having a length of about 6 to 50 nucleotides, and sequences 

30 complementary thereto. Preferably, the immunoglobulin to be 
mutagenized is a human immunoglobulin, the CDR is CDR3, the 
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mutagenizing oligonucleotide has the formula: 

5'-GTGTATTATTGTGCGAGA[NNS]3TGGGGCCAAGGGACCACG-3' (SEQ 
ID No. 24), and the preselected metal ion-containing molecule is 
magnetite, copper(ll), zinc{ll), lead(ll), cerium(lll), or iron(lll). 
5 Kang et al., Virus Res., 49(2) : 147-54 (1997) isolated human 

papillomavirus (HPV) type 18 E7 gene by polymerase chain reaction (PGR) 
amplification from tissues of Korean cervical cancer patients and cloned 
into a plasmid vector, pET-3a, for the expression of recombinant E7 
protein (rE7) in Escherichia coli. The rE7 protein was purified to the 

10 homogeneity and its purity was confirmed by HPLC. The purified protein 
was analyzed for the metal-binding properties by UV spectroscopy and it 
was shown that two Cd^* or Zn^"^ ions bind to one E7 protein by the 
metal-sulfur ligand formation via two Cys-X-X-Cys motifs In E7 protein. 
When the change of intrinsic fluorescence of tryptophan residue was 

15 analyzed for rE7-Zn complex, the blue shift of emission wavelength and 
the decrease in maximum intensity of emission were observed compared 
with rE7. These results suggest that Zn^'^-bound rE7 has undergone 
conformational change, in which a tryptophan residue located in the 
second Cys-X-X-Cys motif was moved into solvent-inaccessible or 

20 hydrophobic environment. 

Dealwis et al., Biochemistry, 34(43) :1 3967-73 (1995) present the 
refined crystal structures of three different conformational states of the 
Asp153">Gly mutant (D153G) of alkaline phosphatase (AP), a 
metalloenzyme from Escherichia coli. The apo state is induced in the 

25 crystal over a 3 month period by metal depletion of the holoenzyme 
crystals. Subsequently, the metals are reintroduced in the crystalline 
state in a time-dependent reversible manner without physically damaging 
the crystals. Two structural intermediates of the holo form based on data 
from a 2 week (intermediate I) and a 2 month soak (intermediate II) of the 

30 apo crystals with Mg^"*^ and Zn^* have been identified. The 
three-dimensional crystal structures of the apo (R = 18.1%), 
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intermediate I (R = 19.5%), and intermediate II (R = 19.9%) of the 
D153G enzyme have been refined and the corresponding structures 
analyzed and compared. Large conformational changes that extend from 
the mutant active site to surface loops, located 20 A away, are observed 
5 in the apo structure with respect to the holo structure. The structure of 
intermediate I shows the recovery of the entire enzyme to an almost 
native-like conformation, with the exception of residues Asp 51 and Asp 
369 in the active site and the surface loop (406-410) which remains 
partially disordered. In the three-dimensional structure of intermediate II, 

10 Asp 51 and Asp 369 are essentially in a native-like conformation, but the 
main chain of residues 406-408 within the loop is still not fully ordered. 
The D153G mutant protein exhibits weak, reversible, time dependent 
metal binding in solution and in. the crystalline state. 

Hutchens et al., J. Chromatogr. , 604(11 :125-32 (1992) prepared 

1 5 synthetic peptides representing metal-binding protein surface domains 
from the human plasma metal transport protein known as histidine-rich 
glycoprotein (HRG) to evaluate biologically relevant peptide-metal ion 
interactions. Three synthetic peptides, representing multiples of a 
5-residue repeat sequence (Gly-His-His-Pro-His) (SEQ ID No. 25) from 

20 within the histidine- and proline-rich region of the C-terminal domain were 
prepared. Prior to immobilization, the synthetic peptides were evaluated 
for identity and sample homogeneity by matrix-assisted UV laser 
desorption time-of-f light mass spectrometry (LDTOF-MS). Peptides with 
bound sodium and potassium ions were observed; however, these signal 

25 intensities were reduced by immersion of the sample probe tip in water. 
Mixtures of the three different synthetic peptides were also evaluated by 
LDTOF-MS after their elution through a special immobilized peptide-metal 
ion column designed to Investigate metal ion transfer. It was found that 
LDTOF-MS to be a useful new method to verify the presence of 

30 peptide-bound metal ions. 

In addition, the protein/metal binding pairs, which are known (see, 

-213- 



wo 01/62968 



PCT/USOl/00452 



e.g., DiDonato, et al.. Adv. Exp. Med. BioL, 448:165-73 (1999) 
(copper/copper binding domain from the Wilson disease copper 
transporting ATPase (ATP7B)); Buchko, et al., Biochem Biophis. Res. 
Commun., 254(1) :109-13 (1999) (Zn^+/Xenopus laevis nucleotide 
5 excision repair protein XPA); Lai, etal., Biochemistry, 37(48):7005-1 5 
(1998) (Zn^"^/hdm2 RING finder domain); Mitterauer, et al.. Biochemistry, 
371461:16183-91 (1998) (The C2 catalytic domain of adenylyl cyclase 
contains the second metal ion {Mn2 + ) binding site); Hess, etai., Protein 
Sci., 7(9) :1 970-5 (1998) (Zn^*^/Human nucleotide excision repair protein 

10 XPA); Goedken, etai, Proteins, 33(1) :135-43 (1998) (Mg^^ and 

Mn^'*"/ribonuclease H domain of Moloney murine leukemia virus reverse 
transcriptase); Chang, etai, Protein Eng., 11111:41-6 (1998) (beta- 
domain of metallothionein); Champeil, etal., J. BioL Chem., 
273(1 2) :661 9-31 (1998) (cytosolic portion of sarcoplasmic reticulum 

15 Ca2-i- -ATPase); Bavoso, etal., Biochem. Biophys. Res. Commun., 

242(21:385-9 (1998) (zinc finger peptide containing the Cys-X2-Cys-X4- 
His-X4-Cys domain encoded by the Drosophila Fw-element); Gitschier, et 
al., Nat Struct Biol., 5(1) :47-54 (1998) (metal-binding domain from the 
Menkes copper-transporting ATPase); Gadhavi, FEBS Lett, 417(1) :145-9 

20 (1997) (Zn^"^/ion binding site in the DNA binding domain of the yeast 
transcriptional activator GAL4); Roehm, etal., Biochemistry, 
36(331:10240-5 (1997) (Zn^VRING finger domain of BRCA1); Dalton, et 
al.,MoL Cell Biol., 17(5) :2781-9 (1997) (metal response element-binding 
transcription factor 1 DNA binding involves zinc interaction with the zinc 

25 finger domain); Essen, et al., Biochemistry, 36(10 :2753-62 (1997) 

(Ca^*/A ternary metal binding site in the C2 domain of phosphoinosltlde- 
specific phospholipase C-deltal); Curtis, etaL, EMBOJ., 16(4) :834:43 
(1997) (Zn^-*^/CCHC metal-binding domain in Nanos); Worthington, etal., 
Proc. Natl. Acad. Sci. USA, 93(241:13754-9 (1996) (zinc-binding domain 

30 of Nup475); Mahadevan, etal., Biochemistry, 34(21:2095-106 (1995) 
(Ba2+, Ca^\ Mg2+, Mn^^. Ni^*, Zn^+ZA divalent metal ion binding site in 
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the kinase insert domain of the alpha-platelet-derived growth factor 
receptor); Pan, eta/., Biochem. Biophys. Res. Commun., 202(1) :621-8 
(1994) (alpha and beta domains of mammalian metallothionein); Borden, 
etaL, FEBSLett., 335121:255-60 (1993) (Cu2^ Zn^+Zcysteine/histidine- 
5 rich metal binding domain from Xenopus nuclear factor XNF7); Chauhan, 
etaL.J. 5acfer/bA, 125(221:7222-7 (1993) (Mg^^/Bradyrhizobium 
japonicum delta-aminolevulinic acid dehydratase is metal-binding domain); 
Knegtel, etaL, Biochem. Biophys. Res, Commun,, 192(2) :492-8 (1993) 
(Zn^'^/metal coordination in the human retinoic acid receptor-beta DNA 

10 binding domain); Spencer, etaL, Biochem. J., 290(1) :279-87 (1993) 
(Co^"*", Mg^^, Zn^'^/5-aminolaevulinic acid dehydratase from Escherichia 
coli reactive thiols at the metal-binding domain); Mau, et aL, Protein Sci., 
1(11) : 1403-1 2 (1992) (Zn2+/GAL4 DNA-binding domain); Vaughan, et 
at.r Virology, 189(1) :377-84 (1992) (Zn^+ZThe herpes simplex virus 

15 immediate early protein ICP27 metal binding domain); Boese, etaL, J. 
Biol. C/7em., 266(261:17060-6 (1991) (Mg^+ZAminolevulinic acid 
dehydratase in pea metal-binding domain); Hutchens, etaL, J. Biol. 
Chem., 264(29) : 17206-1 2 (1989) (Cu^.^, Zn^VDNA-binding 
estrogen receptor); Stillman, etaL, Biochem. J., 262(11 :181-8 (1989) 

20 (Cd^^ and Zn^"^ /rabbit liver metallothionein 2); Freedman, et aL, Nature, 
334(61 82) :543-6 (1988) (Cd^"** and Zn^"^ /metal coordination sites within 
the glucocorticoid receptor DNA binding domain); Stiilman, etaL, J. BioL 
Chem., 2631131:6128-33 (1988) (Cd^^ and Zn^"" /metallothionein); and 
Corson, etaL, Biochemistry, 25(7) :1817-26 (1986) (Ca^'^Zcalcium-binding 

25 proteins C-terminal alpha-helix of a helix-loop-helix metal-binding domain)) 
can be used in the present system. 

Among the preferred pairs, are the following metal binding 
sequenceZmetal ion pairs (see, U.S. Patent No. 5,679,548) set forth in 
the following table. 
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Table 6. Examples of Metal Ion Binding Sequence/Metal Ion Pairs 





ivieTai ion 


metal ion binding sequence 


ScU ID NO. 




ivigui/ 


SerArgArgSerArgHlsHisProArgMetTrpAsnGlyLeuAspVal 


26 






vjiyArgrnBLysArgvaiArgAspArg i rpvai vaiilerneAsprne 


27 






oiyvaiAiaArgberLysLyslvletArgoiyLeuTrpArgLeuAspVal 


. 28 






oiyLeuAiavaiArgoerLysArgGiyArgPnePheLeuPheAspVdl 


29 




Cu(lU 


ulyArgValnisHisHisSerLeuAspVai 


30 






SerTrpLysHisHisAlaHisTrpAspVal 


31 






GlySerTrpAspHisArgGlyCysAspGIy 


32 


1 u 




GlyHisHislvletTyrGlyGlyTrpAspHis 


33 






GlyHisTrpGlyArgHisSerLeuAspThr 


34 






GlyHisMeLeuHisHisGinLeuAspLeu 


35 






SerSerGlnArgLeuMetLeuGlyAspAsn 


36 






SerHisHisGlyHisHisTyrLeuAsnHis 


37 


1 ^ 
1 o 




GlyLysLeuMetMetSerTrpCysArgAspThrGluGlyCysAspHis 


38 






GlyAspThrHisArgGlyHisLeuArgHisHisLeuProHisAspTrp 


39 






G!yTrpGlyLeuTrpMetLysProPheValTrpArgA!aTrpAspMet 


40 




Znill) 


GlyArgVaiHisHisHisSerLeuAspVal 


41 






SerHisThrHisAlaLeuProLeuAspPhe 


42 


on 




•Aa^.T'L.mAa^i.Aa^ 

GlyGlnSerberGlyGlyAspTnrAspAsp 


43 






GlyGlnTrpTnrProArgGlyAspAspPne 


44 






GlyArgCysCysProSerSerCysAspGlu 


45 






GlyProAIaLysHisArgHisArgHisValGlyGlnMetHisAspSer 


46 




Pb(ni) 


GlyAsnLeuArgArgLysThrSerAsplle 


47 






GlyGluSerAspSerLysArgGluAspGly 


48 






GlyGlyProSerLeuAlaVaiGlyAspTrp 


49 






GlyProLeuGInHisThrTyrProAspTyr 


50 






GlyTrpLysValThrAlaGluAspSerThrGluGlyLeuPheAspLeu 


51 






GlyTnrArgValTrpArgVaiCysGinTrpAsnHisGIuGluAspGly 


52 






GlyGluTrpTrpCysSerPheAlalvletCysProAlaArgTrpAspPhe 


53 






GlyAspTnrllePheGlyValThrMetGlyTyrTyrAlaMetAspVal 


54 




Ce(ili) 


GlyGinValMetGInGluLeuGlyAspAla 


55 






GlyLeuThrGIuGlnGlnLeuGlnAspGly 


56 






GlyTyrSerTyrSerValSerProAspAla 


57 


35 




GlyArgLeuGlyLeuValMetThrAspGlu 


58 






SerThrTrpProGlyArgGlnArgLeuGtyGlnAlaLeuSerAspSer 


59 






GlyTyrGIuLeuSerTrpGlyValAspGlnGlnGIuTrpTrpAsplle 


60 






GlyProValArgGlyLeuAspGlnSerLysGlyValArgTyrAspAsn 


61 






GlyLeuSerGlnHislleValSerGluThrGlnSerSerGlyAspLeu 


62 


40 




GlyLeuGluSerLeuLysValLeuGlyVaiGlnLeuGlyGiyAspLeu 


63 






GlyAsnMetlleLeuGlyGlyProGlyCysTrpSerSerAlaAsplle 


64 
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Metal Ion Metal Ion Binding Sequence SEQ ID NO. 

GlyCysTrpAsnValGlnArgLeuValValTyrHisProProAspGly 65 

GlyPheGluValThrCysSerTrpPheGlyHisTrpGlyArgAspSer 66 

Fe(lll) SerAlaSerMetArgSerAlalleGlyLeuTrpArgThrMetAspTyr 67 

GlyAspArgGlullePheHisMetGlnTrpProLeuArgValAspVal 68 

5 SerGlnAsnProGlnGlnValCysGiyValArgCysGlyGlnAspLys 69 

GfyAsnArgteuSerSerGlyHisLeuLsuLysGlnGlyGlnAspGly 70 

GlyGlySerAspTrpGlnlleGlyAlaCysCysArgGIuAspAspLeu 71 

GlyMetValSerMetMetGlyGlnSerArgProThrGInCysAspCys 72 

GlyVallleLysTrpileArgArgTrpValArgThrAlaArgAspVal 73 

1 0 GlyTrpPheTrpArgLeuLeuProThrProArgAlaProSerAspVal 74 



I. Other facilitating agents 
Facilitating agents can be derived from an enzyme, a transport 
protein, a nutrient or storage protein, a contractile or motile protein, a 
1 5 structural protein, a defense protein, a regulatory protein, or a fluorescent 
protein. Exemplary of such other fragments are those derived from an 
enzyme such as a peroxidase, a urease, an alkaline phosphatase, a 
luciferase and a glutathione S-transf erase. 

1) Peroxidase 

20 Any peroxidase can be used in the present system. More 

preferably, a horseradish peroxidase is used. For example, the 
horseradish peroxidases with the following GenBank accession Nos. can 
be used: E01651; D90116 (prxC3 gene);D90115 (prxC2 gene); J05552 
(Synthetic isoenzyme C{HRP-C)); SI 4268 (neutral); OPRHC (CI 

25 precursor); S00627 (CIC precursor); JH0150 (C3 precursor); S00626 
(C1B precursor); JH0149 (C2 precursor); CAA00083 (Armoracia 
rusticana); and AAA72223 (synthetic horseradish perioxidase isoenzyme 
C (HRP-O). 

2) urease 

30 Any urease can be used in the present system. For example, the 

ureases with the following GenBank accession Nos. can be used: 
AF085729 (Ureaplasma urealyticum serovar); AF056321 (Actinomyces 
naeslundii); AF095636 (Yersinia pestis); AF006062 (Filobasidiella 
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neoformans var. neoformans (UREI)); U81509 (Coccidioides immitis 
urease); AF000579 (Bordetella bronchiseptica); U352248 (Streptococcus 
salivarius); U3301 1 (Mycobacterium tuberculosis); U89957 
(Actinobacillus pleuropneumoniae urease operon (ureABCXEFGD); 
5 D14439 (Thermophilic Bacillus); L40490 (Ureapiasma urealyticum T960 
urease); L40489 (Ureapiasma urealyticum strain 7); U40842 (Yersinia 
pseudotuberculosis); M65260 (Canavalia ensiformis); U29368 (Bacillus 
pasteurii urease operon); 125079 (Heliobacter heilmannii urease); L24101 
(Yersinia enterocolitica); M31834 (P.mirabilis urease operon); M36068 

10 (K.aerogenes); L07039 (Klebsiella pneumoniae); M60398 (H. pylori); 

L03308 (E.coli urease gene cluster); L03307 (E.coli urease gene cluster). 
3) Alkaline phosphatase 
Any alkaline phosphatase can be used in the present system. For 
example, the alkaline phosphatases encoded by nucleic acids with the 

15 following GenBank accession Nos. can be used: AB013386 (Bombyx 
mori s-Alp soluble alkaline phosphatase); AF1541 10 (Enterococcus 
faecalis (phoZ); Ml 3077 (Human placental); AF052227 (Bos taurus 
intestinal); AF052226 (Bos taurus intestinal); AF079878 (Thermus sp. 
(TAP)); AF047381 (Pseudomonas aeruginosa (phoA)); U49060 (Bacillus 

20 subtilis (phoD)); J03930 (Human intestinal (ALPI)); J03252 (Human 
alkaline (ALPP)); U19108 (Gallus tissue-nonspecific); M13345 (E. coli); 
U31569 (Felis catus (alpl)); L36230 (Zymomonas mobilis (phoD)); 
Ml 91 59 (Human placental heat-stable (PLAP-1)); Ml 2551 (Human 
placental (FLAP)); M31008 (Human intestinal); J04948 (Human (ALP-1); 

25 J03572 (Rat); M61705 (Mouse intestinal (lAP); M61704 (Mouse 

embryonic); M61706 (Mouse (AP) pseudogene); M21 134 (S.cerevisiae 
(rALPase)); L07733 (Cow intestinal (lAP)); Ml 8443 (Bovine); M77507 
(Synechococcus sp. atypical); M33965 (S.marcescens (phoA)); M33966 
(E.fergusonii (phoA)); M29670 (E.coli (phoA)); M29669 (E.coli (phoA)); 

30 M29668 (E.coli (phoA)); M29667 (E.coli (phoA)); M29666 (E.coli 
(phoA)); M29665 (E.coli (phoA)); M29664 (E.coli (phoA)); M29663 
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Enhancer firefly lucif erase (luc + ) gene); U47296 (Cloning vector pGL3- 
Control firefly luciferase (luc + ) gene); U47295 (Cloning vector pGL3- 
Basic firefly luciferase {luc + ) gene); U47123 (Cloning vector pSP- 
luc + NF, luciferase cassette fusion vector); U47122 (Cloning vector pSP- 
5 luc + , Luciferase cassette vector); M10961 (V.harveyi (luxA and luxB); 
M65067 (Photobacterium phosphoreunn (luxA and luxB); M62917 
(Xenorhabdus luminescens (luxA, luxB, luxC, and luxD); M25666 
(V.hilgendorfii); M63501 (Renilla reniformis); Ml 5077 (P.pyralis (firefly)); 
M26194 (Luciola cruciata); M55977 (X. luminescens (luxA and luxB)); 

10 M90093 (Xenorhabdus luminescens (luxA) and (luxB) (luxE)); U03687 
(Photinus pyralis modified luciferase gene). 

5) Glutathione S-transferase 
A glutathione S-transferase (GST), more preferably a Schistosoma 
japonicum glutathione S-transferase, can be included in the conjugate. 

15 GST occurs naturally as a 26 kDa protein which can be expressed in E. 
coli with full enzymatic activity. Conjugates that contain the full length 
GST also demonstrate GST enzymatic activity and can undergo 
dimerization as observed in nature (Parker et al., J. MoL B/oL, 213 :221 
(1990); Ji, et al.. Biochemistry, 31:10169 (1992); and Maru et al., J. 

20 fl/oA Chem., 271:15353 (1996)). The crystal structure of recombinant 
Schistosoma Japonicum GST from pGEX vectors has been determined 
(McTigue et al., J, Mo/. BioL, 246:21 (1995)) and matches that of the 
native protein. Conjugates that contain a GST can be readily purified. 
For example, fusion proteins are easily purified from bacterial 

25 lysates by affinity chromatography using Glutathione Sepharose 4B 

contained in the GST Purification Modules (Amersham Pharmacia Biotech, 
Inc.). Cleavage of the desired protein from GST is achieved using a site- 
specific protease whose recognition sequence is located immediately 
upstream from the multiple cloning site on the pGEX plasmids. Fusion 

30 proteins can be detected using a colorimetric assay or Immunoassay 
provided in the GST Detection Module, or by Western blotting with anti- 
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GST antibody. The system has been used successfully in many 
applications such as molecular immunology (Toye et al.. Infect Immun., 
58:3909 (1990)), the production of vaccines (Fikrig et al., Science, 
250:553 (1990); and Johnson et a!., Nature, 338:585 (1989)) and 
5 studies involving protein-protein (Kaelin et al., Cell, 64-521 (1991)) and 
DNA-protein (Kaelin et al., Cell, 65:1073 (1991)) interactions. 

Any glutathione S-transferase is contemplated. For example, the 
glutathione S-transferase encoded by nucleic acid with the following 
GenBank accession Nos. can be used: [AP1 12567], Fasciola gigantica; 

10 [M77682], Fasciola hepatica; [AB016426], Cavia porcellus; [AF1 44382], 
Arabidopsis thaliana; [AF133251], Gallus; [AB021655], Issatchenkia 
orientalis; [AF1 33268], Manduca sexta; [AF1 25273], Homo sapiens 
tissue-type skeletal muscle; [AF125271], Homo sapiens tissue-type 
pancreas; [AB026292], Sphingomonas paucimobilis; [AB026119], 

15 Oncorhynchus nerka; (U49179], Bos taurus; [AF1 06661], Rattus 
norvegicus (GstYb4); IL15387], Gallus class-alpha; [AF051318], 
Clonorchis sinensis; [AFi01269], Echinococcus granulosus; [AF0776091, 
Boophilus microplus; [AA9560871, Homo sapiens microsomal; 
[AF004358], Aegilops squarrosa; [AF109714], Triticum aestivum; 

20 [U86635], Rattus norvegicus glutathione; [AF111428], Drosophila- 
melanogaster microsomal; [AF1 1 1426], Drosophila melanogaster 
microsomal; [AF071163], Anopheles gambiae; [AF071162], Anopheles 
gambiae; [AF071161], Anopheles gambiae; [AF071160], Anopheles 
gambiae; [D10524], Nicotiana tabacum; [AF062403], Oryza sativa; 

25 [U77604], Homo sapiens microsomal (MGST2); [U30897], Human (Pib); 
[U62589], Human (GSTpIc); [U42463], Coccomyxa sp. PA; 
[AF001779], Sphingomonas paucimobilis strain epa505; IU51165], 
Cycloclasticus oligotrophus (XYLK); [AF025887], Homo sapiens 
(GSTA4); [U66342], Plutella xylostella; [AF051238], Picea mariana 

30 (Sb52); [AF051214], Picea mariana (Sb18); [AF07951 1], 

Mesembryanthemum crystallinum clone R6-R37; [D 10026], Rattus 
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norvegicus Yrs-Yrs; IAF048978], Glycine max 2,4-D inducible (GSTa); 
[AF043105), Homo sapiens IGSTM3); [AF057172], Homo sapiens 
(GSTT2P); IU216891, Human; [AH006027], Homo sapiens {GSTT2); 
[AF057176], Homo sapiens IGSTT2); [AF0501021, Oryza sativa (GST1); 
5 [AF044411], Schistosoma japonlcum; [U87958], Culicoides variipennis 
(CVGST1); [AF026977], Homo sapiens microsomal (jVIGSTS); 
[AF027740], Homo sapiens microsomal (MGST1L1); [AF005928], 
Ecliinococcus granulosus; [AF001103], Pseudomonas (phnC); 
tAF010241], Caenorhabditis elegans (CeGSTS); [AF010240], 

10 Caenorhabditis elegans (CeGST2); [AF010239], Caenorhabditis elegans 
(CeGSTI); [AF002692], Solanum commersonii (GST1); [L38503], Homo 
sapiens (GSTT2); [M97937I, E. coli/S. japonicium; [L29427], Rat GST-P 
gene; [Ml 4654], Schistosoma japonlcum Sj26 antigen; [AB000884], Sus 
scrofa; [D44465], Arabidopsis thaliana; [D1 7673], Arabidopsis thaliana; - 

1 5 [D1 7672], Arabidopsis thaliana; [U78784], Anopheles dirus; [U71 21 3], 
Human microsomal; [U70672], Arabidopsis thaliana; [U24428], Mus 
musculus; [U43126], Naegleria fowleri; [X14233], D.melanogaster (GST); 
[L32092], IN/Ianduca sexta; [L32091], Manduca sexta; [U304891, 
Arabidopsis thaliana; [M24889], Artificial rhaize; [L05915], Dianthus 

20 caryophyllus; [Ml 5872], Human; [L23766], OryctolagUs cuniculus; 

[J036791, Solanum tuberosum; [U12472], Human (GST phi); [U15654], 
Mus musculus; (M24485], Homo sapiens (GSTP1); [L28771], 
Onchocerca volvulus; [M14777], Human; [M16594], Human; [M21758], 
Human; [J039141, Rat; [KOI 932], Rat liver; [J02810], Rat prostate; 

25 [M25891J, Rat; [Ml 1719], Rat liver; [M28241], Rat; [J037521, Rat; 
(M73483], Mouse (GST Yc); [J046961, Mouse (GST5-5); [J04632], 
Mouse (QST1-1); [M59772], M.auratus; [L20466], Chinese hamster; 
[M256271, Human liver; [J03746], Human (SEQ ID No. 75); [Ml 6901], 
Maize; [M64268], Dianthus caryophyllus; [L11601], Arabidopsis thaliana; 

30 [L07589], Arabidopsis thaliana; [M74529], Oryctolagus cuniculus; 

[M74528], Oryctolagus cuniculus; [M98271], Schistosoma mansoni 28 
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kDa; [L23126], Lucilia cuprina; [M95198], Drosophila melanogaster; 
[L26544], Methylophilus sp.; [U14753], Dirofilaria immitus; [U12679], 
Zea mays; [L023211, Human (GSTM5); [LI 5386], Chicken. 

In addition, commercially available Glutathione S-transferase (GST) 
5 gene fusion system can be used. For example, the Glutathione S- 
transferase (GST) Gene Fusion System (Amersham Pharmacia Biotech, 
Inc.) can be used. The system from Amersham Pharmacia Biotech, Inc. is 
an integrated system for the expression, purification and detection of 
fusion proteins produced in £ coli. The system includes three primary 

10 components: pGEX plasmid vectors, various options for GST purification 
and a variety of GST detection products. A series of site-specific 
proteases complements the system. The pGEX plasmids are designed for 
inducible, high-level intracellular expression of genes or gene fragments 
as fusions with Schistosoma japonicum GST (Smith and Johnson, Gene, - 

15 67:31 (1988)). All pGEX Vectors (GST Gene fusion) offer: 1) A tac 

promoter for chemically inducible, high-level expression; 2) an internal lac 
P gene for use in any £. coli host; 3) very mild elution conditions for 
release of fusion proteins form the affinity matrix, thus minimizing effects 
on antigenicity and functional activity; and 4) PreScission, thrombin or 

20 factor Xa protease recognition sites for cleaving the desired protein from 
the fusion product. 

The GST Detection Module from Amersham Pharmacia Biotech, 
Inc. can be used for identification of GST fusion proteins using either a 
biochemical or immunological assay. In the biochemical assay, 

25 glutathione and 1 -chloro-2-4-dinitrobenzene (CDNB) serve as substrates 
for GST to yield a yellow product detectable at 340 nm (Habig et al., J. 
Biol Cliem., 249:7130 (1974)). An affinity-purified goat anti-GST 
polyclonal antibody suitable for Western blots is used in the 
immunoassay. 

30 The GST 96-Well Detection Module from Amersham Pharmacia 

Biotech, Inc. contains five microtiter strip plates, horseradish peroxidase 
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(HRP) conjugated anti-GST antibody and recombinant GST protein. The 
wells of each plate are coated with purified anti-GST antibody to capture 
GST fusion proteins and are preblocked to provide a low background. 
HRP conjugated antibody enables sensitive detection of GST proteins. 
5 The anti-GST antibody supplied in the system from Amersham 

Pharmacia Biotech, Inc. is a polyclonal antibody purified from the sera of 
goats immunized with purified schistosomal glutathione S-transferase 
(GST). Because of its polyclonal nature, it can recognize more than one 
epitope on GST, thereby improving its capacity for recognizing GST 
10 fusion proteins even if some binding sites are masked due to recombinant 
protein folding. 

Factor Xa can be used for site-specific separation of the GST 
affinity tag from proteins expressed using pGEX X vectors. Factor Xa 
enables the site-specific cleavage of fusion proteins containing an 

1 5 accessible Factor Xa recognition sequence. It can be used either 
following affinity purification or" while fusion proteins are bound to 
Glutathione Sepharose 4B. Factor Xa, purified from bovine plasma, is 
used to digest fusion proteins prepared from pGEX vectors containing the 
recognition sequence for factor Xa {pGEX-3X, pGEX-5X-1, pGEX-5X-2 

20 and pGEX-5X-3). It specifically cleaves following the tetrapeptide lle-Glu- 
Gly-Arg (SEQ ID No. 77) (Nagai and Thogersen, Nature, 309:810 (1984); 
and Nagai and Th0gersen, Methods Enzymol.. 153 :461 (1987)). In the 
system from Amersham Pharmacia Biotech, Inc., one unit of Factor Xa 
cleaves ^ 90% of 1(D0 /yg of a test GST fusion protein when incubated in 

25 1 mM CaClj, 100 mM NaCI and 50 mM Tris-HCI (pH 8.0) at 22^C for 16 
hours. 

PreScission protease can be used for site-specific separation of the 
GST affinity tag from proteins expressed using pGEX-6P vectors. It 
enables the low-temperature cleavage of fusion proteins containing the 
30 PreScission Protease recognition sequence. It can be used either 
following affinity purification or while fusion proteins are bound to 
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Glutathione Sepharose 4B. PreScission Protease is a genetically 
engineered fusion protein containing human rhinovirus 3C protease and 
GST (Walker et al., Bio/Technology, 12:601 (1 994)). This protease was 
specifically designed to facilitate removal of the protease by allowing 
5 simultaneous protease immobilization and cleavage of GST fusion 

proteins produced from pGEX-6P vectors (pGEX-6P-1, pGEX-6P-2, and 
pGEX-6P-3). PreScission Protease specifically cleaves between the Gin 
and Gly residues of the recognition sequence of 
LeuGluValLeuPheGln/GlyPro (SEQ ID No. 78) (Cordingley et al., J, Bio. 

10 Chem,, 265:9062 (1990)). In the system from Amersham Pharmacia 
Biotech, Inc., one unit of PreScission protease will cleaved 90% of 100 
tjq of a test GST-fusion protein in 50 mM Tris-HCI, 150 mM NaCI, 1 mM 
EDTA, 1 mM DTT, pH 7.0 at 5°C for 16 hours. 

Thrombin can be used for site-specific separation of the GST 

1 5 affinity tag from proteins expressed using pGEX T vectors. It enables the 
site-specific cleavage of fusion proteins containing an accessible thrombin 
recognition sequence. It is purified from bovine plasma; functionally free 
of other clotting factors, plasminogen and plasmin. It can be used either 
following affinity purification or while fusion proteins are bound to 

20 Glutathione Sepharose 48. Thrombin is used to digest fusion proteins 
prepared from pGEX vectors containing the recognition sequence for 
thrombin (pGEX-1/IT, pGEX-2T, pGEX-2TK, pGEX-4T-1, pGEX-4T2 and 
pGEX-4T-3), In the system from Amersham Pharmacia Biotech, Inc., one 
unit of Thrombin cleaves ^ 90% of 100 jC/g of a test GST fusion protein 

25 when incubated in 1x PBS at 22 °C for 16 hours. 

6) Defense proteins 
The conjugates can contain defense protein, such as an antibody. 
Any antibody, including polyclonal, monoclonal, single chain or Fab 
fragments, can be used. 



-225- 



wo 01/62968 



PCT/USOl/00452 



7) Fluorescent moieties 
The conjugates can contain a fluorescent moiety, such as a green, 
a blue or a red fluorescent protein. Any green, blue or red fluorescent 
protein can be used in the present system. For instance, the green 
5 fluorescent proteins encoded by nucleic acids with the following GenBank 
- accession Nos. can be used: U47949 (AGP1); U43284; AF007834 
(GFPuv); U89686 {Saccharomyces cerevisiae synthetic green fluorescent 
protein {cox3::GFPm-3) gene); U89685 (Saccharomyces cerevisiae 
synthetic green fluorescent protein (cox3::GFPm) gene); U87974 

10 (Synthetic construct modified green fluorescent protein GFP5-ER (mgfp5- 
ER)); U87973 (Synthetic construct modified green fluorescent protein 
GFP5 (mgfp5)); U87625 (Synthetic construct modified green fluorescent 
protein GFP-ER (mfgp4-ER)); U87624 (Synthetic construct green 
fluorescent protein (mgfp4) mRNA)); U73901 (Aequorea victoria mutant - 

15 3); U50963 (Synthetic); U70495 (soluble-modified green fluorescent 
protein (smGFP)); U57609 (enhanced green fluorescent protein gene); 
U57608 (enhanced green fluorescent protein gene); U57607 (enhanced 
green fluorescent protein gene); U57606 (enhanced green fluorescent 
protein gene); U55763 (enhanced green fluorescent protein (egfp); 

20 U55762 (enhanced green fluorescent protein (egfp); U55761 (enhanced 
green fluorescent protein (egfp); U54830 (Synthetic E. coli Tn3-derived 
transposon green fluorescent protein (GF); U36202; U36201; U19282; 
U19279; U19277; U19276; U19281; U19280; U19278; L29345 
(Aequorea victoria); M62654 (Aequorea victoria); I\/I62653 (Aequorea 

25 victoria); AAB47853 ((U87625) synthetic construct modified green 

fluorescent protein (GFP-ER)); AAB47852 ((U87624) synthetic construct 
green fluorescent protein). 

Similarly, the blue fluorescent proteins encoded by nucleic acids 
with the following GenBank accession Nos. can be used: U70497 

30 (soluble-modified blue fluorescent protein (smBFP); 1BFP (blue variant of 
green fluorescent protein); AAB16959 (soluble-modified blue fluorescent 
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protein). 

Also similarly, the red fluorescent proteins encoded by nucleic 
acids with the following GenBank accession Nos. can be used: U70496 
(soluble-modified red-shifted green fluorescent protein (smRSGFP); 
5 AAB16958 ({U70496) soluble-modified red-shifted green fluorescent 
protein). 

H. IMMOBILIZATION OF MUTANT DNA REPAIR ENZYMES AND 
NUCLEIC ACIDS 

10 In the methods for detecting abnormal base-pairings, mutations, 

and polymorphisms, and the methods for localizing and removing 
abnormal base-pairings described in Sections B-F, the target nucleic acid 
strand to be assayed, the reference nucleic acid strand, the target nucleic 
acid duplex to be assayed, the nucleic acid duplex formed via 

15 hybridization of the target strand and the reference strand, or the mutant - 
DNA repair enzyme or complex thereof can be immobilized on the surface 
of a support, either directly via a linker. Preferably, the support used is 
an insoluble support such as a silicon chip. Non-limiting examples of the 
geometry of the support include beads, pellets, disks, capillaries, hollow 

20 fibers, needles, solid fibers, random shapes, thin films, membranes and 
chips. Also more preferably, the nucleic acid strand, the nucleic acid 
duplex or the mutant DNA repair enzyme or complex thereof is 
immobilized in an array or a well format on the surface. 

1 . Immobilization of the mutant DNA repair enzymes 

25 In certain embodiments, where the facilitating agents are designed 

for linkage to surfaces, recovered, isolated or purified conjugates, such as 
* fusion proteins can be attached to a surface of a matrix material. 
Immobilization may be effected directly or via a linker. The conjugates 
may be immobilized on any suitable support, including, but are not limited 

30 to, silicon chips, and other supports described herein and known to those 
of skill in the art. A plurality of conjugates, which may contain the same 
or different or a variety of mutant DNA repair enzymes (abnormal base- 
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pairing trapping enzynnes) may be attached to a support, such as an array 
{i.e., a pattern of two or more) of conjugates on the surface of a silicon 
chip or other chip for use in high throughput protocols and formats. 

It is also noted that the mutant DNA repair enzymes can be linked 
5 directly to the surface or via a linlcer without a facilitating agent linked 
^ thereto. Hence, chips containing arrays of mutant DNA repair enzymes 
are contemplated. 

For example, an isolated or purified fusion protein can be attached 
to the surface as the intact fusion proteins. Alternatively, the protein or 

10 peptide fragment portion can be cleaved off and the mutant DNA repair 
enzyme be attached to the surface. The fusion protein can be cleaved by 
any methods known in the art such as chemical or enzymatic means. 
The cleavage means must be compatible with the linking sequence 
between the protein or peptide fragment portion and the mutant DNA 

1 5 repair enzyme so that the cleavage is linker sequence specific and the 
cleaved mutant enzyme is functional, i.e., can be used as a abnormal 
base-pairing-trapping enzyme. Those skilled in the art can readily 
determine, if necessary, with empirical studies, which cleavage/linker 
sequence pair to be used. Many cleavage/linker sequence pairs are well 

20 known in the art. For example, Factor Xa can be used for site-specific 
separation of the GST affinity tag from proteins expressed using pGEX X 
vectors; PreScission protease can be used for site-specific separation of 
the GST affinity tag from proteins expressed using pGEX-6P vectors; and 
Thrombin can be used for site-specific separation of the GST affinity tag 

25 from proteins expressed using pGEX T vectors. 

The matrix material substrates contemplated herein are generally 
insoluble materials used to immobilize ligands and other molecules, and 
are those that are used in many chemical syntheses and separations. 
Such substrates, also called matrices, are used, for example, in affinity 

30 chromatography, in the immobilization of biologically active materials, and 
during chemical syntheses of biomolecules, including proteins, amino 
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acids and other organic molecules and polymers. The preparation of and 
use of matrices is well known to those of skill in this art; there are many 
such materials and preparations thereof known. For example, naturally- 
occurring matrix materials, such as agarose and cellulose, may be isolated 
5 from their respective sources, and processed according to known 

protocols, and synthetic materials may be prepared in accord with known 
protocols. 

The substrate matrices are typically insoluble materials that are 
solid, porous, deformable, or hard, and have any required structure and 

10 geometry, including, but not limited to: beads, pellets, disks, capillaries, 
hollow fibers, needles, solid fibers, random shapes, thin films and 
membranes. Thus, the item may be fabricated from the matrix material 
or combined with it, such as by coating all or part of the surface or 
impregnating particles. 

1 5 Typically, when the matrix is particulate, the particles are at least 

about 10-2000 //M, but may be smaller or larger, depending upon the 
selected application. Selection of the matrices will be governed, at least 
in part, by their physical and chemical properties, such as solubility, 
functional groups, mechanical stability, surface area swelling propensity, 

20 hydrophobic or hydrophilic properties and intended use. 

If necessary, the support matrix material can be treated to contain 
an appropriate reactive moiety. In some cases, the support matrix 
material already containing the reactive moiety may be obtained 
commercially. The support matrix material containing the reactive moiety 

25 may thereby serve as the matrix support upon which molecules are 
linked. Materials containing reactive surface moieties such as amino 
silane linkages, hydroxyl linkages or carboxysilane linkages may be 
produced by well established surface chemistry techniques involving 
silanization reactions, or the like. Examples of these materials are those 

30 having surface silicon oxide moieties, covalently linked to gamma-amino- 
propylsilane, and other organic moieties; N-[3-(triethyoxysilyl)propyll- 
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phthelamic acid; and bis-(2-hydroxyethyl)aminopropyltriethoxysilane. 
Exemplary of readily available materials containing amino group reactive 
functionalities, include, but are not limited to, para-aminophenyl-. 
triethyoxysilane. Also derivatized polystyrenes and other such polymers 
5 are well known and readily available to those of skill in this art ( e.g. , the 
Tentagel® Resins are available with a multitude of functional groups, and 
are sold by Rapp Polymere, Tubingen, Germany; see, U.S. Patent No. 
4,908,405 and U.S. Patent No. 5,292,814; see, also Butz et al.. Peptide 
Res.. 7:20-23 (1994); and Kleine et al., ImmunobioL, 190 :53-66 (1994)). 

10 These matrix materials include any material that can act as a 

support matrix for attachment of the molecules of interest. Such 
materials are known to those of skill in this art, and include those that are 
used as a support matrix. These materials include, but are not limited to, 
inorganics, natural polymers, and synthetic polymers, including, but are 

15 not limited to: cellulose, cellulose derivatives, acrylic resins, glass, silica 
gels, polystyrene, gelatin, polyvinyl pyrrolidone, co-polymers of vinyl and 
acrylamide, polystyrene cross-linked with divinylbenzene and others (see, 
Merrifield, Biochemistry, 3:1385-1390 (1964)), polyacrylamides, latex 
gels, polystyrene, dextran, polyacrylamides, rubber, silicon, plastics, 

20 nitrocellulose, celluloses, natural sponges. Of particular interest herein, 
are highly porous glasses {see, e.g., U.S. Patent No. 4,244,721) and 
others prepared by mixing a borosilicate, alcohol and water. 

Synthetic matrices include, but are not limited to: acrylamides, 
dextran-derivatives and dextran co-polymers, agarose-polyacrylamide 

25 blends, other polymers and co-polymers with various functional groups, 
methacrylate derivatives and co-polymers, polystyrene and polystyrene 
copolymers {see, e.g., Merrifield, ff/oc/7em/sfr/, 3:1385-1390 (1964); 
Berg et al., in Innovation Perspect. Solid Phase Synth. Collect. Pap., Int, 
Symp., 1st, Epton, Roger (Ed), pp. 453-459 (1990); Berg et al., Pept, 

30 Proc. Eur. Pept. Symp., 20th, Jung, G. et ah (Eds), pp. 196-198 (1989); 
Berg et al., J. Am. Chem. Soc, 111:8024-8026 (1989); Kent et al., Isr. 
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J. Chem., 12:243-247 (1979); Kent et al., J. Org. Chem., 43:2845-2852 
(1978); Mitchell et al., Tetrahedron Lett. , 42:3795-3798 (197*6); U.S. 
Patent No. 4,507,230; U.S. Patent No, 4,006,1 17; and U.S. Patent No. 
5,389,449). Methods for preparation of such nnatrices are well-known to 
5 those of skill in this art. 

Synthetic matrices include those made from polymers and co-poly- 
mers such as polyvinylalcohols, acrylates and acrylic acids such as poly- 
ethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethylene- 
co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co-acry- 

10 lie acid, polypropylene-co-methyl-acrylic acid, polypropylene-co-ethylacry- 
late, polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, 
polypropylene-co-vinyl acetate, and those containing acid anhydride 
groups such as polyethylene-co-maleic anhydride, polypropylene-co- 
maleic anhydride and the like. Liposomes have also been used as solid 

15 supports for affinity purifications (Powell et al. Biotechnol. Bioeng., 
33:173 (1989)), 

For example, U.S. Patent No. 5,403,750, describes the preparation 
of polyurethane-based polymers. U.S. Pat. No. 4,241,537 describes a 
plant growth medium containing a hydrophilic polyurethane gel composi- 

20 tion prepared from chain-extended polyols; random copolymerization is 
preferred with up to 50% propylene oxide units so that the prepolymer 
will be a liquid at room temperature. U.S. Pat. No. 3,939,123 describes 
lightly crosslinked polyurethane polymers of isocyanate terminated 
prepolymers containing poly{ethyleneoxy) glycols with up to 35% of a 

25 poly(propyleneoxy) glycol or a poly(butyleneoxy) glycol. In producing 
these polymers, an organic polyamine is used as a crosslinking agent. 
Other matrices and preparation thereof are described in U.S. Patent Nos. 
, 4,177,038, 4,175,183, 4,439,585, 4,485,227, 4,569,981, 5,092,992, 
5,334,640, 5,328,603. 

30 U.S. Patent No. 4,162,355 describes a polymer suitable for use in 

affinity chromatography, which is a polymer of an aminimide and a vinyl 
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compound having at least one pendant halo-methyl group. An amine 
ligand, which affords sites for binding in affinity chromatography is 
coupled to the polymer by reaction with a portion of the pendant 
halo-methyl groups and the remainder of the pendant halo-methyl groups 
5 are reacted with an amine containing a pendant hydrophilic group. A 
method of coating a substrate with this polymer is also described. An 
exemplary aminimide is 1,1-dimethyl-1-(2-hydroxyoctyi)amine methacryl- 
imide and vinyl compound is a chloromethyl styrene. 

U.S. Patent No. 4,171,412 describes specific matrices based on 

10 hydrophilic polymeric gels, preferably of a macroporous character, which 
carry covalently bonded D-amIno acids or peptides that contain D-amino 
acid units. The basic support is prepared by copolymerization of 
hydroxyalkyi esters or hydroxyaikylamides of acrylic and methacrylic acid 
with crosslinking acrylate or methacrylate comonomers are modified by 

15 the reaction with diamines, aminoacids or dicarboxylic acids and the 
resulting carboxyterminal or aminoterminal groups are condensed with 
D-analogs of aminoacids or peptides. The peptide containing D-amino- 
acids also can be synthesized stepwise on the surface of the carrier. For 
example, U.S. Patent No, 4,178,439 describes a cationic ion exchanger 

20 and a method for preparation thereof. U.S. Patent No. 4,180,524 
describes chemical syntheses on a silica support. 

The fusion protein can be attached to the surface of the nnatrix 
material by methods known in the art. Numerous methods have been 
developed for the immobilization of proteins and other biomolecules onto 

25 solid or liquid supports {see, e.g., Mosbach, Methods in Enzymology, 44 
(1976); Weetall, Immobilized Enzymes, Antigens, Antibodies, and 
Peptides, (1975); Kennedy et al., Solid P/iase Biochemistry, Analytical 
and Synthetic Aspects, Scouten, ed., pp. 253-391 (1983); see, 
generally, Affinity Techniques. Enzyme Purification: Part B, Methods in 

30 Enzymology, Vol. 34, ed. W. B*. Jakoby, M. Wilchek, Acad. Press, N.Y. 
(1974); and Immobilized Biochemicals and Affinity Chromatography, 
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Advances in Experimental Medicine and Biology, vol. 42, ed. R. Dunlap, 
Plenum Press, N.Y. (1974)). 

Among the most commonly used methods are absorption and ad- 
sorption or covalent binding to the support, either directly or via a linker, 
5 such as the numerous disulfide linkages, thioether bonds, hindered 
disulfide bonds, and covalent bonds between free reactive groups, such 
as amine and thiol groups, known to those of skill in art (see, e.g., the 
PIERCE CATALOG, ImmunoTechnology Catalog & Handbook, 1992- 
1 993, which describes the preparation* of and use of such reagents and 

10 provides a commercial source for such reagents; Wong, Ctiemistry of 
Protein Conjugation and Cross Linking, CRC Press (1 993); see also 
DeWitt et aL, Proc. NatL Acad. Sci. U.S.A., 90:6909 (1993); 
Zuckermann et al., J. Am. Chem. Soc, 114 :10646 (1992); Kurth et al., 
J, Am. Chem. Soc, 116:2661 (1994); Ellman et aL, Proc. NatL Acad. 

15 ScL U.S.A., 91:4708 (1994); Sucholeiki, Tetrahedron Lttrs., 35:7307 
(1994); Su-Sun Wang, J. Org. Chem., 41:3258 (1976); Padwa et aL, J. 
Org. Chem., 41:3550 (1971); and Vedejs et al., J. Org. Chem., 49:575 
(1984), which describe photosensitive linkers). 

To effect immobilization, a composition containing the protein or 

20 other biomolecule is contacted with a support material such as alumina, 
carbon, an ion-exchange resin, cellulose, glass or a ceramic. 
Fluorocarbon polymers have been used as supports to which 
biomolecules have been attached by adsorption (see, U.S. Patent 
No. 3,843,443; Published International PCT Application WO 86/03840). 

25 A large variety of methods are known for attaching biological 

molecules, including proteins and nucleic acids, molecules to solid 
supports (see e.g., U.S. Patent No. 5451683). For example, U.S. Pat. 
No. 4,681,870 describes a method for introducing free amino or carboxyl 
groups onto a silica matrix. These groups may subsequently be 

30 covalently linked to other groups, such as a protein or other anti-ligand, in 
the presence of a carbodiimide. Alternatively, a silica matrix may be 
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activated by treatment with a cyanogen halide under alkaline conditions. 
The anti-ligand is covalently attached to the surface upon addition to the 
activated surface. Another method involves modification of a polymer 
surface through the successive application of multiple layers of biotin, 
5 avidin and extenders {see e.g., U.S. Patent No. 4,282,287). Other 

methods involve photoactivation in which a polypeptide chain is attached 
to a solid substrate by incorporating a light-sensitive unnatural amino acid 
group into the polypeptide chain and exposing the product to low-energy 
ultraviolet light {see e.g., U.S. Patent No. 4,762,881). Oligonucleotides 

10 have also been attached using a photochemically active reagent, such as 
a psoralen compound, and a coupling agent, which attaches the 
photoreagent to the substrate {see e.g., U.S. Patent No, 4,542,102 and 
U.S. Patent No, 4,562,157). Photoactivation of the photoreagent binds a 
nucleic acid molecule to the substrate to give a surface-bound probe. 

1 5 Covalent binding of the protein or other biomolecule or organic 

molecule or biological particle to chemically activated solid matrix 
supports such as glass, synthetic polymers, and cross-linked 
polysaccharides is a more frequently used immobilization technique. The 
molecule or biological particle may be directly linked to the matrix support 

20 or linked via linker, such as a metal {see, e.g., U.S. Patent No. 

4,179,402; and Smith et al., Methods: A Companion to Methods in Enz., 
^.IZ-IQ (1992)). An example of this method is the cyanogen bromide 
activation of polysaccharide supports, such as agarose. The use of 
perfiuorocarbon polymer-based supports for enzyme immobilization and 

25 affinity chromatography is described in U.S. Pat. No. 4,885,250. In this 
method the biomolecule Is first modified by reaction with a perfluoroalkyt- 
ating agent such as perfluorooctylpropylisocyanate described in U.S. Pat. 
No. 4,954,444. Then, the modified protein Is adsorbed onto the fluoro- 
carbon support to effect Immobilization. 
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The activation and use of matrices are well known and may be 
effected by any such known methods [see, e.g., Hermanson et a!., 
Immobilized Affinity Ligand Techniques, Academic Press, Inc., San Diego 
(1 992)). For example, the coupling of the amino acids may be 
5 accomplished by techniques familiar to those in the art and provided, for 
example,^ in Stewart and Young, Solid Phase Synthesis, Second Edition, 
Pierce Chemical Co., Rockford (1984). 

Other suitable methods for linking molecules to solid supports are 
well known to those of skill in this art {see, e.g., U.S. Patent No. 

10 5,416,193). These include linkers that are suitable for chemically linking 
molecules, such as proteins, to supports and include, but are not limited 
to, disulfide bonds, thioether bonds, hindered disulfide bonds, and 
covalent bonds between free reactive groups, such as amine and thiol 
groups. These bonds can be produced using heterobifunctional reagents 

15 to produce reactive thiol groups on one or both of the moieties and then 
reacting the thiol groups on one moiety with reactive thiol groups or 
amine groups to which reactive maleimido groups or thiol groups can be 
attached on the other. 

Other linkers include, acid cleavable linkers, such as 

20 bismaleimideothoxy propane, acid labile-transferrin conjugates and adipic 
acid diihydrazide, that would be cleaved in more acidic intracellular 
compartments; cross linkers that are cleaved upon exposure to UV or 
visible light and linkers, such as the various domains, such as ChI, 0^2, 
and Ch3, from the constant region of human IgG^ (Batra et al., Molecular 

25 Immunol., 30:379-386 (1993)). Presently preferred linkages are direct 
linkages effected by adsorbing the molecule to the surface of the matrix. 

Other linkages are photocleavable linkages that can be activated by 
exposure to light (see, e.g., Goldmacher et al., Bioconj. Chem., 3:104- 
107 (1992)). The photocleavable linker is selected such that the cleaving 

30 wavelength does not damage linked moieties. Photocleavable linkers are 
linkers that are cleaved upon exposure to light (see, e.g., Hazum et al., 
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Pept, Proc. Eur. Pept Symp,, 16th, Brunfeldt, K (Ed),.pp, 105-110 

(1981), which describes the use of a nitrobenzyl group as a 

photocleavable protective group for cysteine; Yen et al., Makromol. 

Chem., 190 :69-82 (1989), which describes water soluble photocleavable 
5 copolymers, including hydroxypropylmethacrylamide copolymer, glycine 

copolymer, fluorescein copolymer and methylrhodamine copolymer; 

Goldmacher et a!., Bioconj\ Chem., 3:104-107 (1992), which describes a 

cross-linker and reagent that undergoes photolytic degradation upon 

exposure to near UV light (350 nm); and Senter et al., Photochem. 
10 PhotobioL, 42:231-237 (1985), which describes nitrobenzyloxycarbonyl 

chloride cross linking reagents that produce photocleavable linkages). 

The selected linker will depend upon the particular application and, if 

needed, may be empirically selected. 

In a preferred embodiment, the recovered fusion protein is attached " 
1 5 to the surface through affinity binding between the protein or peptide 

fragment of the fusion protein and an affinity binding moiety on the 

surface. 

2. Immobilization of nucleic acids 

The target nucleic acid strand to be assayed, the reference nucleic 
20 acid strand, the target nucleic acid duplex to be assayed, the nucleic acid 
duplex formed via hybridization of the target strand and the referisnce 
strand can be immobilized by any methods known in the art. For 
example, the immobilization procedures disclosed in the following 
literatures can be used: Bresser et al., DMA, 2121:243-54 (1983); 
25 HIrayama et aL, Nucleic Acids Res., 241201:4098-9 (1 996); Kremsky et 
al.. Nucleic Acids Res., 15121:2891-909 (1987); Macdougall et aL, 
Biochem. J., 191(3) :855-8 (1980); Mykoniatis, J. Biochem. Biophys. 
Methods, 10(5-6) :321-8 (1985); Nagasawa et aL, J. Appl. Biochem., 
7(4-5) :296-302 (1985); Nikiforov and Rogers, Anal. Biochem., 
30 227111:201-9 (1 995); Proudnikov et aL, Anal. Biochem., 259(1 ) :34-41 
(1998); Rasmussen et aL, Anal. Biochem., 198(1) :138-42 (1991); and 
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Rogers et aL, Anal. Biochem., 266(1 ) :23-30 (1999). 

Bresser et al., DNA, 2131:243-54 (1983) discloses a method for 
selectively immobilizing either mRNA or DNA on nitrocellulose. Essential 
elements of the procedure for immobilizing DNA include tissue lysis, 
5 proteinase K treatment, solubilization of nucleic acids in hot 12.2 molal 
Nal, passage through a nitrocellulose filter, and acetylation of residual 
protein with acetic anhydride. Advantages include speed, quantitative 
recovery, low background, and elimination of the usual baking step. 
Essential elements of the procedure for selectively immobilizing mRNA 

10 include dissolving cells in Brij-35 and desoxycholate, proteinase K 

treatment solubilizing nucleic acids in room temperature 12.2 molal Nal, 
filtration through nitrocellulose, and acetylation of residual protein. 
Advantages include selective immobilization of mRNA but not tRNA, 
rRNA, or DNA, and the maintenance of biological activity of the 

15 immobilized mRNA. 

Hirayama et al., Nucleic Acids Res., 24(20) :4098-9 (1996) 
discloses an improved and simplified protocol for DNA immobilization to 
enhance DNA-DNA hybridization on microwell plates. Target DNA was 
immobilized by simple dry-adsorption. Efficiencies of DNA immobilization 

20 and retention were enhanced 1 .4-6.5 times and 4,2-1 9.6 times, 
respectively, compared with a conventional method. The overall 
hybridization efficiency was increased 3.1-5.2 times. This simple new 
protocol can reduce the consumption of scarce DNA samples. 

Kremsky et a!.. Nucleic Acids Res., 15(7) :2891-909 (1987) 

25 discloses a general method for the immobilization of DNA through its 
5'-end has. A synthetic oligonucleotide, modified at its 5'-end with an 
aldehyde or carboxylic acid, was attached to latex microspheres 
containing hydrazide residues. Using T4 polynucleotide ligase and an 
oligonucleotide splint, a single stranded 98mer was efficiently joined to 

30 the immobilized synthetic fragment. After impregnation of the latex 
microspheres with the fluorescent dye, Nile Red and attachment of an 
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aldehyde 1 6mer, 5X10^ bead-DNA conjugates could be detected with a 
conventional fluorimeter. 

Macdougall et al., Biochem. J., 191(3) :855-8 (1980) discloses a 
method in which double-stranded DNA is alkylated with 
5 4-bis-{2-chloroethyl)amino-L-phenylalanineand the product immobilized 
on an insoluble support via the primary amino group of the phenylalanine 
moiety. The DNA is irreversibly bound to the matrix by both strands at a 
limited number of points. 

Mykoniatis, J. Biochem, Biophy$. Methods, 10(5-6) :321-8 (1985) 

10 discloses a method for the immobilization of DNA on Sephadex G200 in 
the presence of water soluble carbodiimide. An increase in the extent of 
binding was observed when the incubation temperature of the 
DNA-Sephadex mixture was changed. It was found that native DNA 
immobilized to Sephadex with higher efficiency than denatured DNA. The - 

1 5 stability of native DNA-Sephadex complex was about the same as that of 
denatured DNA-Sephadex. The size of DNA released by DNA-Sephadex 
after incubation of a suspension of the complex was the same as that of 
the DNA used for immobilization. 

Nagasawa et al., J. AppL Biochem., 7(4-5) :296-302 (1985) 

20 discloses a method in which DNA was immobilized covalently to 

Sepharose by several methods using epichlorohydrin, cyanogen bromide, 
carbodiimide, hydroxysuccinimide, carbonyldiimidazole, trichlorotriazine, 
and diazonium salt. These immobilizing methods were compared from 
the standpoint of the preparation of immunosorbent for anti-DNA 

25 antibodies. Among these methods, that involving epichiorohydrin was 
the most suitable because of large coupling capacity, stability of bound 
DNA, and nonadsorption of anti-DNA by the support itself. 

Nikiforov and Rogers, Anal. Biochem., 227(1) :201-9 (1995) 
discloses 3 methods for the immobilization of relatively short (12-30 mer) 

30 oligonucleotide probes to 96-well polystyrene plates for use in DNA 
hybridization-based assays. Two of the methods are modifications of 
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previously published procedures, requiring the use of modified 
oligonucleotides and/or modified plates. These were compared to a 
newly developed method, whereby passive immobilization occurs by 
incubation in the presence of salt or a cationic detergent. While all 

. 5 methods resulted in the productive binding of the DNA probes and could 
therefore be used for hybridization, only the passive immobilization 
approach met strict performance criteria for use in DNA genotyping. 

Proudnikov et a!., AnaL Biochem., 259(1 ) :34-41 (1998) discloses 
immobilization of DNA in polyacrylamide gel for the manufacture of DNA 

10 and DNA-oligonucleotide microchips. Activated DNA was immobilized in 
aldehyde-containing polyacrylamide gel for use in manufacturing the 
MAGIChip (microarrays of gel-immobilized compounds on a chip). First, 
abasic sites were generated in DNA by partial acidic depurination. Amino 
groups were then introduced into the abasic sites by reaction with 

1 5 ethylenediamine and reduction of the aldimine bonds formed. It was 
found that DNA could be fragmented at the site of amino group 
incorporation or preserved mostly unfragmented. In similar reactions, 
amino-DNA and amino-oligonucleotides were attached through their 
amines to polyacrylamide gel derivatized with aldehyde groups. Single- 

20 and double-stranded DNA of 40 to 972 nucleotides or base pairs were 
immobilized on the gel pads to manufacture a DNA microchip. The 
microchip was hybridized with fluorescently labeled DNA-specific 
oligonucleotide probes. This procedure for immobilization of amino 
compounds was used to manufacture MAGIChips containing DNA and 

25 oligonucleotides. 

Rasmussen et al., Anal. Biochem., 1 98(1 ) :1 38-42 (1991) discloses 
covalent immobilization of DNA onto polystyrene microwells via the 
DNA's 5' end. DNA is bound onto the microwells by formation of a 
phosphoramidate bopd between the 5' terminal phosphate group and the 

30 microwells. Immobilization of 25 to 30 ng DNA per well is obtained. 
DNA molecules bound covalently at only the 5' end are, ideally, perfect 

-239- 



wo 01/62968 



PCT/USOl/00452 



for hybridization. 

Rogers et ai., Anal. Biochem., 266111:23-30 (1999) discloses 
immobilization of oligonucleotides onto a glass support via disulfide bonds 
in preparation of DNA microarrays. This method provides an efficient and 
5 specific covalent attachment chemistry for immobilization of DNA probes 
onto a solid support. Glass slides were derivatized with 3-mercaptopropyl 
silane for attachment of 5-prime disulfide-modified oligonucleotides via 
disulfide bonds. An attachment density of approximately 3x10^ 
oligonucleotides/micron^ was observed. Oligonucleotides attached by this 
10 method provided a highly efficient substrate for nucleic acid hybridization 
and primer extension assays. In addition, patterning of multiple DNA 
probes on a glass surface utilizing this attachment chemistry has been 
demonstrated, which allows for array densities of at least 20,000 
spots/cm^. 

15 1. HIGH-THROUGHPUT ASSAY FORMAT 

Although the methods for detecting abnormal base-pairing, 
mutations or polymorphisms, or methods for removing or localizing such 
abnormal base-pairing described in Sections B-F can be used wherein a 
single sample is assayed in one assay, the assay is preferably conducted 

20 in a high throughput mode, i.e,, a plurality of the abnormal base-pairing, 
mutations or polymorphisms are detected, localized and/or removed 
simultaneously (See generally, High Throughput Screening: The Discovery 
of Bioactive Substances (Devlin, Ed.) Marcel Dekker, 1997; Sittampalam 
et al., Curr. Opin. Cham. BloL, 1(3) :384-91 (1997); and Silverman et al., 

25 Curr. Opin. Chem. Biol., 2(3) :397-403 (1998)). For example, the assay 
can be conducted in a multi-well (e.g., 24-, 48-, 96-, or 384-well), chip 
or array format. 

Current state-of-the-art high-throughput assay operations are 
highly automated and computerized to handle sample preparation, assay 

30 procedures and the subsequent processing of large volumes of data. 
Each one of these steps requires careful optimization to operate 
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efficiently and can assay 100-300,000 samples in a 2-6 month period. 
Hence, a modern high-throughput assay operation is a multidisciplinary 
field involving analytical chemistry, biology, biochemistry, synthesis 
chemistry, molecular biology, automation engineering and computer 
5 science (Fernandes, J, Biomol. Screening, 2:1 (1997)), 

1 . High-throughput assay instrumentation and capabilities 
In general, the instrumentation used in high-throughput assays 

should be accurate, reliable and easily amenable to automation. 
Analytical methods should be robust and reproducible, with stable 
10 reagents and signal responses. Signal-to-noise (S/N) ratios should be 
large enough to generate signal windows (Sittampalam et aL, J. Biomol. 
Screening, 2:159-169 (1997)) that allow reliable detection of "hits". 

2. Detection technologies 

Detection tischnologies employed in high-throughput screens 
1 5 depend on the type of biochemical pathway being investigated 

(Sittampalam et al., Curr, Opin. Chen). Biol., 1(3):384-91 (1997)). 
a. Radiochemical methods 
Although filtration-based receptor binding assays have been used 

extensively in the past (to separate the bound and free radiolabeled 
20 ligand), the scintillation proximity assay (SPA) has become the standard 

assay in many HTS operations, mainly because it does not require a 

separation step, and can be easily automated (Braunwalder et al., J. 

Biomol. Screening, 1:23-26 (1996); Cole, IVIethods EnzymoL, 275:310- 

328 (1996); Cook, Drug Discov. Tech., 1:287-294 (1996); Kahl et al., J. 
25 Biomol. Screening, 2:33-40 (1997); Lerner et al., J. Biomol. Screening, 

1:135-143 (1996); Baker et al., Anal. Biochem., 239:20-24 (1996); 

Baum et al.. Anal. Biochem., 237:129-134 (1996); Sullivan et al„ J. 

Biomol. Screening, 2:19-23 (1997); De Serres et al.. Anal. Biochem., 

233:228-233 (1996); Sonatore et al.. Anal. Bigchem., 240:289-297 
30 (1996); Chen et al., J. Biol. Chem., 271:25308-25315 (1996); Patel et 

al., Biochem. Biophys. Res. Commun., 221:821-825 (1996); and Fox, 
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Pharm. Forum, 6:1-3 (1996)). SPA can also be easily adapted to a 
variety of enzyme assays (Lerner et al., J. BiorrioL Screening, 1:135-143 
(1996); Baker et al., Anal. Biochem., 239 :20-24 (1996); Baum et al., 
AnaL Biochem., 237 :129-134 (1996); and Sullivan et a!., J. BiomoL 
5 Screening, 2:19-23 (1997)) and protein-protein interaction assays 

(Braunwalder et al., J. BiomoL Screening, 1:23-26 (1996); Sonatore et 
al., AnaL Biocliem,, 240:289-297 (1996); and Chen et al., J. Biol. 
Chem., 271:25308-25315 (1996)). 

One version of SPA utilizes polyvinyltoluene (PVT) microspheres or 

10 beads (-5 //m diameter, density -1.05 g/cm^) into which a scintillant 
has been incorporated (Hook, Drug Discov. Tech., 1:287-294 (1996)). 
When a radiolabeled ligand is captured on the surface of the bead, the 
radioactive decay occurs in close proximity to the bead, and effectively 
transfers energy to the scintillant, which results in light emission. When 

15 the radiolabel is displaced or inhibited from binding to the bead, it remains 
free in solution and is too distant from the scintillant for efficient energy 
transfer. Energy from radioactive decay is dissipated into the solution, 
which results in no light emission from the beads. Hence, the bound and 
free radiolabel can be detected without the physical separation required in 

20 filtration assays. 

The ideal isotopes for labeling ligands used in SPA assays are ^H 
and ^^^1. This is because the ^ particles from ^H have a relatively short 
pathlength, about 1.5 /jM, which easily fulfills the distance requirement 
for SPA. The Auger electrons emitted by ^^^1, which travel between 

25 approximately ^^\x^ and 17,6A/m in aqueous media, also satisfy this 
distance requirement. 

SPA can also be carried out in scintillating microplates 
(Braunwalder et al., J. BiomoL Screening, 1:23-26 (1996); Fox, Pharm. 
Forum, 6:1-3 (1996); and Harris et al., AnaL Biochem., 243 :249-256 

30 (1996)), in which the scintillant is directly incorporated into the plastic, or 
is coated on the inner surface of the wells. These plates are 
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commercially available. For example, Flashplate® is from NEN^** Life 
Science Products (Boston, MA) in which the scintillant is coated on the 
inner surface of the wells. The Scinitstrip® plate is from WallacOy 
(Turku, Finland) which is made by incorporating the scintillant into the 
5 entire plastic, A more recent development is the Cytostar-T™ (Amerisham 
Life Sciences, Cardiff, Wales) scintillating microplates (Fox, Pharm. 
Forum, 6:1-3 (1996) which were specially designed for cell-based 
proximity assays. Scintillant is incorporated into the base plate of 
microtiter plates and pan also detect additional isotopes such as ^*C, 

10 ^^Ca, ^^S, and ^^P. 

b. Non-isotopic detection methods 

1 ) Colorimetry and luminescence 
Colorimetric and luminescence detection methods have significant 
advantages for HTS laboratories, particularly in light of the cost, safety 

1 5 and disposal issues associates with radiochemical methods. Since 

luminescence methods can be as sensitive as radioactive methods, with 
low detection limits, these techniques are being used increasingly in HTS 
assays (Brown et al., Curn Opin. Biotechnol., 8:45-49 (1997); Glazer, 
BioRadiations, 98:4-8 (1997); Czarnik, Chem, BioL, 2:423-428 (1995); 

20 Wang et al., Tetrahedron Lett,, 31:6493-6496 (1991); Mathis, Clin, 

Chem., 41:1391-1397 (1995); Kolb et al., J. Biomol. Screening, 1:203- 
210 (1996); Gonzalez et al., Biophys. J., 69:1272-1280 (1995); 
Schroeder et al., J. Biomol. Screening, 1:75-80 (1996); Waggoner et al.. 
Hum. Pathol: , 27:494-502 (1996); Jameson et al., Methods Enzymol., 

25 246:283-300 (1995); Lundblad et al.. Mot. Endocrinol., 10:607-612 
(1996); Checovich et al., Nature, 375:254-256 (1995); Levine et al., 
Anal. Biochem., 247 :83-88 (1997); Jolley, J. Biomol. Screening, 1:33-38 
(1996); Schade et al., Anal. Biochem., 243:1-7 (1996); Lynch et aL, 
Anal. Biochem., 247:77-82 (1997); Starrer et aL, J. Recept. Signal 

30 Transduct Res. , 17:5 1 1 -520 ( 1 997); Rigler, J. Biotechnol. , 41: 1 77-1 86 
(1995); Rauer et aL, Biophys. Chem., 58:3-12 (1996); Sarubbi et aL, 
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Anat. Biochem., 237:70-75 (1996); Rose et al., Network Science, 2(9): 1- 
12 (1996); Dhundale et al., J. BiomoL Screening, 1:1 15-118 (1996); 
Suto et al., J. Biomol. Screening, 2:7-9 (1997); Bronstein et al.. Anal, 
Biochem., 2^9:^69-^8^ (1994); Hastings, Gene, 173:5-11 (1996); Lehel 
5 et al., AnaL Biocliem., 244:340-346 (1997); Kolb et al., J. Biomol. 
Screening, 1:85-88 (1996); Bran et al., J. Biomol. Screening, 1:43-45 
(1996); Rizzuto et al., Curr Biol,, 6:183-188 (1996)). Glazer (Glazer, 
BioRadiations, 98:4-8 (1997)) and Czarnik (Czarnik, Ctiem. BioL, 2:423- 
428 (1995)) and the Fluorescent Chemosensors and Biosensors Database 

10 on the World Wide Web URL; http://biomednet.com/fluoro/ have reviewed 
the utility and need for fluorescence-based techniques for biological 
applications, which can be easily extended to HTS assays. 

3} Resonance energy transfer 
Resonance energy transfer (RET) between a fluorophore and 

1 5 chromophore was one of the earliest methods developed for HTS. For 
example, a peptide substrate for an HIV protease was synthesized with 
EDANS (as the amino terminus) as the donor fluorophore, and DABCYL 
(at the carboxyl terminus) as the acceptor chromophore (Wang et al., 
Tetratiedron Lett., 31:6493-6496 (1991)), Energy transfer from EDANS 

20 to DABCYL in the intact peptide resulted in quenching of EDANS 
fluorescence, 

3) . Time-resolved fluorescence 

A new homogeneous time-resolved fluorescence (HTRF) 
technology has been described (Mathis, Clin. Chem., 41:1391-1397 

25 (1995)). The assay utilizes fluorescence energy transfer between two 
fluorophores (an europium cryptate and a 1 05kDa phycobiliprotein, 
allophycocyanin) as labels. The Eu-trisbipyridine cryptate (TBR-EU^**", A^^ 
= 337 rim) has two bipyridyl groups that harvest light and channel it to 
the caged Eu^**". It has a long fluorescence, lifetime and nonradioactively 

30 transfers the energy to allophycocyanin when the two labels are in close 
proximity {>50% transfer efficiency at a donor-acceptor distance of 9.5 
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nm). The resulting fluorescence of allophycocyanin yl^^ = 665 nm) 
retains the long lifetime of the donor TBP)-EU^*, allowing time-resolved 
measurement. These labels and their spectroscopic characteristics are 
very stable in biological media. 
5 4) Cell-based fluorescence assays 

An interesting fluorescence resonance energy transfer (FRET) 
procedure for sensing voltage across cell membranes has been described 
recently (Gonzalez et al., Biophys. J., 69:1272-1280 (1995)). The 
technique uses membrane permeable, anionic oxonols which rapidly 

10 locate on the inner or outer membrane surface depending on polarization 
state of the membrane. FRET occurs between fluorescein-labeled WGA 
and the oxonols bound to the other surface of the membrane at a resting 
negative potential. As a positive potential, the oxonols are relocated to 
the inner membrane surface, and the FRET is greatly reduced. 

1 5 Many fluorescence intensity measurements, including FRET, can be 

configured on a instruments specifically designed for cell-based HTS 
assays in 96-well or higher density plates called FLIPr (Schroeder et al., 
J. Biomol. Screening, 1:75-80 (1996)]. FLIPR utilizes a water-cooled 
argon ion laser (5 watt) or a xenon are lamp and a semiconfocal optical 

20 system with a charge-coupled device (CCD) camera to illuminate and 

image the entire plate. The spatial resolution of the optics is -200 //m at 
the cell plane. The plate chamber temperature can be controlled 
precisely, and a 96-well pipettor head is integrated into the instrument. 
These features allow accurate measurements of cellular' biochemistry in 

25 confluent layers of cells at the bottom of plates. FLIPR software can 
rapidly quantify transient fluorescence signals in intact cells that are 
growing attached to the bottom of the well. HTS assays involving 
intracellular calcium, pH and membrane potential measurements have 
been designed using this instrument (Waggoner et al.. Hum, Pathol., 

30 27:494-502(1996)). 
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5) Fluorescence polarization 

Another technique that has gained popularity recently is 
fluorescence polarization or anisotropy (Jameson et a\., Methods 
EnzymoL, 246:283-300 (1995); Lundblad et aL, MoL Endocrinol., 
5 10:607-612 (1996); Checovich et al.. Nature, 375:254-256 (1995); 
Levine et al., AnaL Biochem,, 247 :83-88 (1997); Jolley, J. BionioL 
Screening, 1:33-38 (1996); Schade et al., AnaL Biochem., 243:1-7 
(1996); Lynch et al., Anai, Biocfiem., 247:77-82 (1997)). When 
fluorescently labeled molecules in solution are illuminated with plane- 

•1 0 polarized light, the emitted fluorescence will be in the same plane 

provided the molecules remain stationary. Since all molecules tumble as 
a result of collisional motion, depolarization phenomenon is proportional 
to the rotational relaxation time (jj) of the molecule, which is defined by 
the expression 3/7V/RT. At constant viscosity {rj) and temperature (T) of 

1 5 the solution, polarization is directly proportional to the molecular volume 
(V) (R is the universal gas constant). Hence, changes in molecular 
volume or molecular weight due to binding interactions can be detected 
as a change in polarization. For example, the binding of a fluorescently 
labeled ligand to its receptor will result in significant changes in measured 

20 fluorescence polarization values for the ligand. Once again, the 
measurements can be made in a "mix and measure" mode without 
physical separation of the bound and free ligands. The polarization 
measurements are relatively insensitive to fluctuations in fluorescence 
intensity when working in solutions with moderate optical intensity. 

25 6) Fluorescence correlation spectroscopy 

Fluorescence correlation spectroscopy (FCS) has been recently 
described for HTS applications (Starrer et al., J. Recept. Signal Transduct 
Res., 17:511-520 (1997); Rigler, J. BiotechnoL, 41:177-186 (1995); 
Rauer et al., Biophys. Cliem., 58:3-12 (1996)). FCS measures time- 

30 dependent and spontaneous fluctuations in fluorescence intensities in 
very small volumes (nanoliters). These fluctuations usually result from 
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Brownian motion associated with chemical reactions, diffusion or the 
flow of fluorescently labeled molecules. The average fluctuation is 
proportional to the square foot of N, where N is the average number of 
molecules in the volume. Since Brownian diffusion is directly affected by 
5 molecular interactions, FCS is an excellent tool to measure binding 
interactions (Brown et al,, Cum Opin. Biotechnol. , 8:45-49 (1997)). 
Using powerful lasers and autocorrelation techniques, sensitive 
measurements (at concentrations of ---lO-^^M) can be made in solution 
and in cellular compartments. 

10 3. Miniaturization 

Several factors ar^ fueling efforts to increase the speed of HTS and 
decrease the volume of individual reactions within an HTS format 
(Silverman et al., Cum Opin. Chem. Biol., 2131:397-403 (1998)). Split- 
bead synthesis, or other similar approaches to combinatorial chemistry, 

15 dramatically increases the number of compounds that can be produced in 
a library but do so at the cost of quantity of material. 

One approach involves reducing the well size and increasing the 
density of the assay plate but retaining the overall assay format used in 
current 96-well based HTS. Densities of 6,500 assays in a 10 cm array 

20 have been reported to cell-free enzyme based assays (Schuliek et al., 
AnaL Biochem., 246 :20-29 (1997)) and for ligand binding in cell based 
assays (You et al., Chem. Biol., 4:969-975 (1997)). This approach of 
miniaturizing existing formats significantly increases the number of assays 
per plate and the overall throughput of the screen but is intrinsically 

25 limited by the physical constraints of delivering small volumes to wells, 
and of detecting responses in a sensitive and timely manner. Another 
approach uses glass chips containing microchannels in which reagents, 
target proteins and compounds are herded by electrokinetic flow 
controlled by electric potentials applied at the ends Of the channels (Hadd 

30 et al.. Anal. Chem., 69:3407-3412 (1997)). A related approach attains 
high-throughput of chemical synthesis and activity assessment by parallel 
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arrays of three-dimensional channels in which flow is controlled by 
miniature hydrostatic actuators (Rogers, Drug Discov. Today, 2:306 
(1997)). These approaches provide significant reduction in the volume of 
assays and a corresponding savings in reagent costs over conventional 
5 HTS. In addition, with further development in parallel processing in 
multiple chips, the number of assays performed in a given period of time 
can increase dramatically. 

In a specific embodiment, the HTS methods disclosed in the 
following literatures can be used, with or without modification, in the 

10 present methods for detecting, localizing and/or removing abnormal base- 
pairing, mutations and polymorphisms: Janzen et a!.. The 384-well plate: 
pros and cons, J. Biomol. Screening, 1:63-64 (1996); Lutz, et al., 
Experimental design for high-throughput screening, Drug Discov. Tech,, 
1:277-286 (1996); Klein, et a!., Recombinant microorganisms as tools for 

15 high throughput screening for non antibiotic compounds, J. Bion)oL 

Screening, 2:41-49 (1997); Webb, et al.. Transcription-specific assay for 
quantifying mRNA: A potential replacement for reporter gene assays, J, 
Biomol. Screening, 1:1 19-121 (1996); Charych, et al.. Direct colorimetric 
detection of receptor-ligand interaction by a polymerized bilayer assembly, 

20 Science, 261:585-588 (1993); Charych, et al., A 'litmus test' for 

molecular recognition using artificial membranes, Cfiem. Biol., 3:1 13-120 
(1996); Spevak, et al.. Carbohydrates in an acidic multivalent assembly: 
nanomolar P-selectin inhibitors, J. Med. Chem., 38:1018-1020 (1996); 
Allen, et al.. Atomic force microscopy in analytical biotechnology. Trends 

25 BiotechnoL, 15:101-105 (1997); Troy, et al.. Scanning force microscopy 
helps in the design of cancer drugs, Biophoton Int., 9/10 :52-53 (1996); 
Paborsky, et al., A nickel chelate microtiter plate assay for six histidine- 
containing proteins, Anal Biochem., 234 :60-65 (1996); Weiss-Wichert, 
et al., A new analytical device based on gated ion channels: A peptide 

30 channel biosensor, J. Biomol. Screening, 2:1 1-18 (1997); Brecht, et al.. 
Transducer-based approaches for parallel binding assays in HTS, J. 
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BiomoL Screening, 1:191-201 (1996); Tyagi, et al., Molecular beacons: 
probes that fluoresce upon hybridization, Nat BiotechnoL, 14:303-308 
(1996); Heller, et al., Discovery and analysis of inflammatory disease- 
related genes using cDNA microarrays, Proc. Natl. Acad. Sci USA, 
5 94:2150-2155 (1997); Nicolaou, et al., Radiofrequency encoded 

combinatorial chemistry, Angew Cham. Int Ed., 34:2i289-2291 (1995); 
Fitzgerald, et al.. Direct characterization of solid phase resin-bound 
molecules by. mass spectrometry, Bioorg. Med. Chem. Lett., 6:979-982 
(1996); Chu, et a!., Affinity capillary electrophoresis-mass spectrometry 

10 for screening combinatorial libraries, J. Am. Chem. Soc, 1 18 :7827"7835 
(1996); and Evans, et al., Affinity-based screening of combinatorial 
libraries using automated, serial-column chromatography, Nat. 
BiotechnoL, 14:504-507 (1 996). 
J. SAMPLE COLLECTION 

15 Any sample can be assayed for detecting, localizing and/or 

removing abnormal base-pairing, mutations or polymorphisms using the 
methods described in the above Sections B-F. In one embodiment, the 
sample being assayed is a biological sample from a mammal, particularly 
a human, such as a biological fluid or a biological tissue. Biological fluids, 

20 include, but are not limited to, urine, blood, plasma, serum, saliva, 

semen, stool, sputum, hair and other keratinous samples, cerebral spinal 
fluid, tears, mucus and amniotic fluid. Biological tissues contemplated 
include, but are not limited to, aggregates of cells, usually of a particular 
kind together with their intercellular substance that form one of the 

25 structural materials of a human, animal, plant, bacterial, fungal or viral 
structure, including connective, epithelium, muscle and nerve tissues, 
organs, tumors, lymph nodes, arteries and individual cell(s). In one 
specific embodiment, the body fluid to be assayed is urine. In another 
specific embodiment, the body fluid to be assayed is blood. Preferably, 

30 the blood sample Is further separated into a plasma or sera fraction. 

Serum or plasma can be recovered from the collected blood by any 

-249- 



wo 01/62968 PCTAJSOl/00452 

methods known in the art. In one specific embodiment, the serum or 
plasma is recovered. from the collected blood by centrifugation. 
Preferably, the centrifugation is conducted in the presence of a sealant 
having a specific gravity greater than that of the serum or plasma and 
5 less than that of the blood corpuscles which will form the lower, whereby 
upon centrifugation, the sealant forms a separator between the upper 
serum or plasma layer and the lower blood corpuscle layer. The sealants 
that can be used in the processes include, but are not limited to, styrene 
resin powders (Japanese Patent Publication No. 38841/1973), pellets or 

10 plates of a hydrogel of a crosslinked polymer of 2-hydroxy ethyl 
methacrylate or acrylamide (U.S. Patent No. 3,647,070), beads of 
polystyrene bearing an antithrombus agent or a wetting agent on the . 
surfaces (U.S. Patent No. 3,464,890) and a silicone fluid (U.S. Patent 
Nos. 3,852,T94 and 3,780,935). In a preferred embodiment, the sealant 

15 is a polymer of unsubstituted alkyi acrylates and/or unsubstituted alky! 
methacrylates, the alkyI moiety having not more than 18 carbon atoms, 
the polymer material having a specific gravity of about 1 .03 to 1 .08 and 
a viscosity of about 5,000 to 1,000,000 cps at a shearing speed of 
about 1 second*^ when measured at about 25^C (U.S. Patent No. 

20 4,140,631). 

In another specific embodiment, the serum or plasma is recovered 
from the collected blood by filtration. Preferably, the blood is filtered 
through a layer of glass fibers with an average diameter of about 0.2 to 5 
,// and a density of about 0.1 to 0.5 g./cm^ the total volume of the 

25 plasma or serum to be separated being at most about 50% of the 

absorption volume of the glass fiber layer; and collecting the run-through 
from the glass fiber layer which is plasma or serum (U.S. Patent No. 
4,477,575). Also preferably, the blood is filtered through a layer of glass 
fibers having an average diameter 0.5 to 2.5 // impregnated with a 

30 polyacrylic ester derivative and polyethylene glycol (U.S. Patent No. 
5,364,533). More preferably, the polyacrylic ester derivative is 
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poly{butyl acrylate), poly(mi3thyl acrylate) or poly(ethyl acrylate), and (a) 
poly{butyl acrylate), (b) polylmethyl acrylate) or poly(ethyl acrylate) and 
(c) polyethylene glycol are used in admixture at a ratio of (10-12):(1- 
4):(l-4). In still another specific embodiment, the serum or plasma is 
5 recovered from the collected blood by treating the blood with a coagulant 
containing a lignan skelton having oxygen-containing side chains or rings 
(U.S. Patent No. 4,803,153). Preferably, the coagulant contains a lignan 
skelton having oxygen-containing side chains or rings, e.g., d-sesamin, I- 
sesamin, paulownin, d-asarinin, l-asarinin, 2a-paulownin, 6a-paulownin, 

10 pinoresinol, d-eudesmin, l-pinoresinol B-D-glucoside, l-pinoresinol, I- 
pinoresinol monomethyl ether B-D-glucoside, epimagnolin, lirioresinol-B, 
syringaresinol (di), lirioresinonB-dlmethyl ether, phillyrin, magnolin, 
lirioresinol-A, 2a, 6a-d-sesamin, d-diaeudesmin, lirioresinol-C dimethyl 
ether (d-diayangambin) and sesamolin. More preferably, the coagulant Is 

1 5 used in an amount ranging from about 0.01 to 50 g per 1 I of the blood. 
K. COMBINATIONS, KITS AND ARTICLES OF MANUFACTURE 
Combinations, kits and articles of manufacture for detecting 
abnormal base-pairings, mutations, polymorphisms, and for localizing 
and/or removing abnormal base-pairings are provided herein. 

20 In a specific embodiment, a combination for detecting abnormal 

base-pairing in a nucleic acid duplex is provided herein, which 
combination comprises: a) a mutant DNA repair enzyme or complex 
thereof; and b) reagents for detecting binding between abnormal base- 
pairing in a nucleic acid duplex and the mutant DNA repair enzyme or 

25 complex thereof. A kit comprising the above combination is also 
provided. An article of manufacture is further provide herein, which 
article of manufacture comprises: a) packaging material; b) the above- 
described combination; and c) a label indicating that the article is for use 
in detecting abnormal base-pairing in a nucleic acid duplex. 

30 In another specific embodiment, a combination for detecting a 

mutation in a nucleic acid duplex is provided herein, which combination 
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comprises: a) a strand of a wild-type nucleic acid complementary to a 
nucleic acid having or suspected of having a mutation; b) a mutant DNA 
repair enzyme or complex thereof; and c) reagents for detecting binding 
between abnormal base-pairing in a nucleic acid duplex and the mutant 
5 DNA repair enzyme or complex thereof. A kit comprising the above 
combination is also provided. An article of manufacture is further 
provided, comprising: a) packaging material; b) the above combination; 
and c) a label indicating that the article is for use in detecting a mutation 
in a nucleic acid duplex. 

10 In still another specific embodiment, a combination for detecting a 

polymorphism in a locus is provided herein, which combination 
comprises: a) a complementary reference strand of a nucleic acid 
comprising a known allele of a locus; b) a mutant DNA repair enzyme or 
complex thereof; and c) reagents for detecting binding between abnormal 

15 base-pairing in a nucleic acid duplex and the mutant DNA repair enzyme 
or complex thereof. A kit comprising the above combination is also 
provided. An article of manufacture is further provided, comprising: a) 
packaging material; b) the above combination; and c) a label indicating 
that the article is for use in detecting a polymorphism in a locus. 

20 In yet another specific embodiment, a combination for removing a 

nucleic acid duplex containing one or more abnormal base-pairing in a 
population of nucleic acid duplexes is provided herein, which combination 
comprises: a) a mutant DNA repair enzyme or complex thereof; and b) 
reagents for removing a binding complex formed between a nucleic acid 

25 duplex containing one or more abnormal base-pairing and the mutant DNA 
repair enzyme or complex thereof. A kit comprising the above 
combination is also provided. An article of manufacture is further 
provided, comprising: a) packaging material; b) the above combination; 
and c) a label indicating that the article is for use in removing a nucleic 

30 acid duplex containing one or more abnormal base-pairing in a population 
of nucleic acid duplexes. 
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In yet another specific embodiment, a combination for detecting 
and localizing an abnormal base-pairing in a nucleic acid duplex is 
provided herein, which combination comprises: a) a mutant DNA repair 
enzyme or complex thereof; and b) an exonuclease. A kit comprising the 
5 above combination is also provided. An article of manufacture is further 
provided, comprising: a) packaging material; b) the above combination; 
and c) a label indicating that the article is for use in for detecting and 
localizing an abnormal base-pairing in a nucleic acid duplex. 

Since modifications will be apparent to those of skill in this art, it is 
1 0 intended that this invention be limited only by the scope of the appended 
claims. 
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CLAIMS: 

1 . A method for detecting abnormal base-pairing in a nucleic 
acid duplex, which method comprises: 

a) contacting a nucleic acid duplex having or suspected of 
5 having an abnormal base-pairing with a mutant nucleic acid repair enzyme 
or complex thereof, wherein the mutant nucleic acid repair enzyme or 
complex thereof has binding affinity for the abnormal base-pairing in the 
duplex but has attenuated catalytic activity compared to the wild-type 
enzyme; and 

1 0 b) detecting binding between the nucleic acid duplex and the 

mutant nucleic acid repair enzyme or complex thereof, whereby the 
presence or quantity of the abnormal base-pairing in the duplex is 
assessed. 

2. The method of claim 1, wherein the nucleic acid duplex is 
15 selected from the group consisting of a DNA:DNA, a DNArRNA and an 

RNA:RNA duplex. 

3. The method of claim 2, wherein the nucleic acid duplex is a 
DNArDNA duplex. 

4. The method of claim 1, wherein the abnormal base-pairing is 
20 selected from the group consisting of a base-pair mismatch, a base 

insertion, a base deletion and a pyrimidine dimer. 

5. The method of claim 4, wherein the base-pair mismatch is a 
single base-pair mismatch. 

6. The method of claim 1, wherein the mutant nucleic acid 
25 repair enzyme or enzyme complex is selected from the group consisting 

of a mutant mutH, a mutant mutL, a mutant mutM, a mutant mutS, a 
mutant mutY, a mutant uvrD, a mutant dam, a mutant thymidine DNA 
glycosylase (TDG), a mutant mismatch-specific DNA glycosylase (MUG), 
a mutant AlkA, a mutant MLHl, a mutant MSH2, a mutant MSH3, a 
30 mutant MSH6, a mutant Exonuclease I, a mutant T4 endonuclease V, a 
mutant FEN1 (RAD27), a mutant DNA polymerase 6, a mutant DNA 
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polymerase a mutant RPA, a mutant PCNA, a mutant RFC, a mutant 
Exonuclease V, a mutant DNA polymerase III holoenzyme, a mutant DNA 
helicase, a mutant RecJ- exonuclease and combinations thereof. 

7. The method of claim 1 , wherein the nucleic acid duplex is 
5 formed by hybridizing a single strands of nucleic acid that contain a 
known sequence with a nucleic acids from a test sample, whereby 
binding of the mutant enzyme to any duplexes indicates that presence of 
a sequence difference in the nucleic acid from the sample from that of 
the nucleic acid containing the known sequence. 
10 8. The method of claim 1, wherein the single strands of nucleic 

acid fragments with known sequences are immobilized on a solid support* 

9. The method of claim 8, wherein the fragments are arranged 
In an array. 

10. The method of claim 8 that is automated. 

15 1 1 . A method for detecting a mutation in a nucleic acid, 

comprising: 

a) hybridizing a strand of a nucleic acid having or suspected of 
having a mutation with a complementary strand of a nucleic acid 
fragment having a wild type sequence, whereby the mutation results in 

20 an abnormal base-pairing in the formed nucleic acid duplex; 

b) contacting the nucleic acid duplex formed in step a) with a 
mutant nucleic acid repair enzyme or complex thereof, wherein the 
mutant nucleic acid repair enzyme or complex thereof has binding affinity 
for the abnormal base-pairing in the duplex but has attenuated catalytic 

25 activity; and 

c) detecting binding between the nucleic acid duplex and the 
mutant nucleic acid repair enzyme or complex thereof, whereby the 
presence or quantity of the mutation is assessed. 

12. The method of claim 1 1 , wherein the nucleic acid strand to 
30 be tested and the complementary wild-type nucleic acid strand are NA 
strands. 
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T3. The method of claim 1 1 , wherein the mutation is associated 
with a disease or disorder, or infection by a pathological agent, and the 
method is used for prognosis or diagnosis of the presence or severity of 
the disease, disorder or infection. 
5 14. The method of claim 13, wherein the disease or disorder is 

selected from the group consisting of a cancer, an immune system 
disease or disorder, a metabolism disease or disorder, a muscle and bone 
disease or disorder, a nervous system disease or disorder, a signal disease 
or disorder and a transporter disease or disorder. 
- 10 15. . The method of claim 13, wherein the a plurality of mutations 

are identified by hybridizing nucleic acid single stands to a plurality of 
different fragments comprising loci encompassing different mutations. 

1 6. The method of claim 1 5 that js automated. 

17. A method for detecting polymorphism in a gene locus, 
1 5 comprising: 

a) hybridizing a target strand of a nucleic acid comprising a 
locus to be tested with a complementary reference strand of a nucleic 
acid comprising a known allele of the locus, whereby the allelic identity 
between the target and the reference strands results in the formation of a 

20 nucleic acid duplex without an abnormal base-pairing and the allelic 
difference between the target and the reference strands results in the 
formation of a nucleic acid duplex with an abnormal base-pairing; 

b) contacting the nucleic acid duplex formed in step a) with a 
mutant nucleic acid repair enzyme or complex thereof, wherein the 

25 mutant nucleic acid repair enzyme or complex thereof has binding affinity 
for the abnormal base-pairing in the duplex but has attenuated catalytic 
activity; and 

c) detecting binding between the nucleic acid duplex and the 
mutant nucleic acid repair enzyme or complex thereof, whereby the 

30 polymorphism in the locus is assessed. 

1 8. The method of claim 1 7, wherein a plurality of reference 
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Strands are hybridized. 

19. The method of claim 18, wherein the reference strands are 
immobilized on a solid support. 
5 20. The method of claim 19, wherein the reference strands are 

immobilized in an array. 

21. The method of claim 17, wherein the polymorphism to be 
detected is a variable nucleotide type polymorphism ("VNTR"). 

22, The method of claim 17, wherein the polymorphism to be 
10 detected is a single nucleotide polymorphism (SNP). 

23. The method of claim 22, wherein the SNP is a human 
genome SNP. 

24. The method of claim 23, wherein the hybridization between 
the target strand of a nucleic acid comprising a locus to be tested and the 

15 complementary reference strand of a nucleic acid comprising a known 
allele of the locus is facilitated by a reconribinase. 

25, The method of claim 18 that is automated. 

26, A method for purifying or separating nucleic acid duplex 
containing one or more abnormal base-pairing from a population of 

20 nucleic acid duplexes, which method comprises: 

a) contacting a population of nucleic acid duplexes having or 
suspected of having a nucleic acid duplex containing one or more 
abnormal base-pairing with a mutant nucleic acid repair enzyme or 
complex thereof, wherein the mutant nucleic acid repair enzyme or 

25 complex thereof has binding affinity for the abnormal base-pairing in the. 
duplex but has attenuated catalytic activity and whereby the nucleic acid 
duplex containing one or more abnormal base-pairing binds to the mutant 
nucleic acid repair enzyme or complex thereof to form a binding complex; 
and 

30 b) removing nucleic acid duplexes that contain the binding 

complex formed in step a) from the population of nucleic acid duplexes. 
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27. The method of claim 1, wherein the abnormal base-pairing is 
selected from the group consisting of a base-pair mismatch, a base 
insertion, a base deletion and a pyrimidine dimer. 

28. The method of claim 11, wherein the abnormal base-pairing 
5 is selected from the group consisting of a base-pair mismatch, a base 

insertion, a base deletion and a pyrimidine dimer. 

29. The .method of claim 26, wherein the abnormal base-pairing 
is selected from the group consisting of a base-pair mismatch, a base 
insertion, a base deletion and a pyrimidine dimer. 

10 30, The method of claim 26, wherein the population of nucleic 

acid duplexes is produced by an enzymatic amplification. 

,31. A method for detecting and localizing an abnormal base- 
pairing in a nucleic acid duplex, which method comprises: 

a) contacting a nucleic acid duplex having or suspected of 

1 5 having an abnormal base-pairing with a mutant nucleic acid repair enzyme 
or complex thereof, wherein the mutant nucleic acid repair enzyme or 
complex thereof has binding affinity for the abnormal base-pairing in the 
duplex but has attenuated catalytic activity and whereby the nucleic acid 
duplex containing an abnormal base-pairing binds to the mutant nucleic 

20 acid repair enzyme or complex thereof to form a binding complex; 

b) subjecting the nucleic acid duplex to hydrolysis with an 
exonuclease under conditions such that the binding complex formed in 
step a) blocks hydrolysis; and 

c) determining the location within the nucleic acid duplex 
25 protected from the hydrolysis, thereby detecting and localizing the 

abnormal base-pairing in the nucleic acid duplex. 

32. The method of claim 31, wherein the nucleic acid duplex is 
selected from the group consisting of a DNA:DNA, a DNA:RNA and a 
RNA:RNA duplex. 

30 33. The method of claim 31., wherein the abnormal base-pairing 

is selected from the group consisting of a base-pair mismatch, a base 
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insertion, a base deletion and a pyrimidine dimer. 

34. The method of claim 31, wherein the exonuclease is 
selected from the group consisting of nuclease BAL-31, exonuclease III, 
Mung Bean exonuclease and Lambda exonuclease. 
5 35. The method of claim 1, wherein the mutant nucleic acid 

repair enzyme or complex thereof is labelled with a detectable label, 

36. The method of claim 35, wherein the mutant nucleic acid 
repair enzyme or complex thereof is labelled with biotin. 

37. The method of claim 36, wherein the binding between the 
10 abnormal base-pairing and the biotin-labelled mutant nucleic acid repair 

enzyme or complex thereof is detected with a streptavidin labeled 
enzyme. 

38. The method of claim 37, wherein the streptavidin labeled 
enzyme is selected from the group consisting of a peroxidase, an urease, 

1 5 an alkaline phosphatase, a luciferase and a glutathione S-transferase. 

39. The method of claim 31 , wherein the mutant nucleic acid 
repair enzyme or complex thereof is labelled. 

40. The method of claim 1 1 , wherein the mutant nucleic acid 
repair enzyme or complex thereof is labelled with a detectable label. 

20 41 . The method of claim 17, wherein the mutant nucleic acid 

repair enzyme or complex thereof is labelled with a detectable label. 

42. The method of claim 26, wherein the mutant nucleic acid 
repair enzyme or complex thereof is labelled with a detectable label. 

43. The method of claim 1, wherein the nucleic acid duplex or 
25 the mutant nucleic acid repair enzyme or complex thereof is immobilized 

on the surface of a support. 

44. The method of claim 43, wherein the nucleic acid duplex or 
the mutant nucleic acid repair enzyme or complex thereof is Immobilized 
directly on the surface or is immobilized on the surface via a linker. 

30 45. The method of claim 43, wherein the insoluble support is a 

silicon chip. 
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46. The method, of claim 45, wherein the geometry of the 
support is selected from the group consisting of beads, pellets, disks, 
capillaries, hollow fibers, needles, solid fibers, random shapes, thin films, 
rhembranes and chips. 
5 47. The method of claim 44, wherein the nucleic acid duplex or 

the mutant nucleic acid repair enzyme or complex thereof is immobilized 
in an array or a well format on the surface. 

48. The method of claim 1 1 , wherein the strand of a nucleic acid 
having or suspected of having a mutation, the complementary strand of a 

10 wild-type nucleic acid, or the mutant nucleic acid repair enzyme or 
complex thereof is immobilized on the surface of a support. 

49. The method of claim 17, wherein the target strand of a 
nucleic acid comprising a locus to be tested, the complerhentary 
reference strand of a nucleic acid comprising a known allele of the locus , 

1 5 or the mutant nucleic acid repair enzyme or complex thereof is 
immobilized on the surface of a support. 

50. The method of claim '26, wherein the mutant nucleic acid 
repair enzyme or complex thereof is immobilized on the surface of a 
support. 

20 51 . The method of claim 31, wherein the nucleic acid duplex 

having or suspected of having an abnormal base-pairing or the mutant 
nucleic acid repair enzyme or complex thereof is immobilized on the 
surface of a support. 

52. The method of claim 1, wherein the nucleic acid duplex 

25 having or suspected of having an abnormal base-pairing is isolated from a 
sample. 

53. The method of claim 52, wherein the sample is a body fluid 
or a biological tissue. 

54. The method of claim 53, wherein the body fluid is selected 
30 from the group consisting of urine, blood, plasma, serum, saliva, semen, 

stool, sputum, cerebral spinal fluid, tears, mucus and amniotic fluid. 
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55. The method of claim 53, wherein the biological tissue is 
selected from the group consisting of connective tissue, epithelium 
tissue, muscle tissue, nerve tissue, organs, tumors, lymph nodes, arteries 
and individual cell(s). 
5 56. The method of claim 11, wherein the strand of a nucleic acid 

having or suspected of having a mutation is isolated from a sample. 

57. The method of claim 17, wherein the strand of a nucleic acid 
comprising a locus to be tested is isolated from a sample, 

58. The method of claim 26, wherein the population of nucleic 
10 acid duplexes is isolated from a sample. 

59. The method of claim 31 , wherein the nucleic acid duplex 
having or suspected of having an abnormal base-pairing is isolated from a 
sample. 

60. The method of claim 1 , wherein abnormal base-pairings in a 
1 5 plurality of the nucleic acid duplexes are detected simultaneously. 

61 . The method of claim 1 1 , wherein mutations in a plurality of 
the nucleic acids are detected simultaneously. 

62. The method of claim 17, wherein polymorphisms in a 
plurality of the loci are detected simultaneously. 

20 63. The method of claim 26, wherein a plurality of nucleic acid 

duplexes containing one or more abnormal base-pairing are removed 
simultaneously. 

64. The method of claim 31, wherein a plurality of the abnormal 
base-pairings are detected and localized simultaneously. 
25 65. A combination for detecting abnormal base-pairing in a 

nucleic acid duplex, which combination comprises: 

a) a mutant nucleic acid repair enzyme or complex thereof; and 

b) a reagent for detecting binding between abnormal base- 
pairing in a nucleic acid duplex and the mutant nucleic acid repair enzyme 

30 or complex thereof. 

66. A kit comprising the cornbination of claim 65 and 

-261- 
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instructions for binding the mutant repair enzyme to nucleic acid duplexes 
to detect a mutation in a nucleic acid duplex, or to detect a polymorphism 
in a locus, or diagnose a disease or disorder or plurality thereof, or for 
gene mapping or identification by detecting a plurality of polymorphisms 
5 or mutations. 

67. An isolated substantially pure mutant nucleic acid repair 
enzyme that further comprises a detectable label, wherein the mutant 
enzyme has attenuated catalytic activity compared to the wild type but 
retains binding affinity for a nucleic acid duplex containing an abnormal 

10 base pairing. 

68. The mutant enzyme of claim 67 that comprises a fusion 
protein or conjugate of the mutant enzyme and an enzyme label. 

69. An isolated substantially pure biotinylated mutant nucleic 
acid repair enzyme. 

15 70. An article of manufacture, comprising: 

a) packaging material; 

b) a mutant nucleic acid repair enzyme that has attenuated 
catalytic activity compared to the wild type but retains binding affinity for 
a nucleic acid duplex containing an abnormal base pairing; and 

20 c) a label indicating that the article is for use in detecting 

abnormal base-pairing in a nucleic acid duplex. 

71 . A combination for detecting and localizing an abnormal base- 
pairing in a nucleic acid duplex, comprising 

a) a mutant nucleic acid repair enzyme or complex thereof, 
25 wherein tge mutant enzyme that has attenuated catalytic activity 

compared to the wild type but retains binding affinity for a nucleic acid 
duplex containing an abnormal base pairing; and 

b) an exonuclease. 

72. A kit, comprising the combination of claim 71 and 
30 instructions for performing an assay for detecting and localizing an 

abnormal base-pairing an a nucleic acid duplex. 

-262- 
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SEQUENCE LISTING 



<110> 



General Atomics 



<120> 



MUTANT NUCLEIC BINDING ENZYMES AND USE THEREOF IN DIAGNOSTIC, 
DETECTION AND PURIFICATION METHODS 



<130> 



46699-20004 .40 



<140> 



Unas signed 



<141> 



Herewith 



<150> 



09/514,016 



<151> 2000-02-25 
<150> 09/457,205 
<151> 1999-12-06 
<150> 09/347,878 
<151> 1999-07-06 
<160> 78 

<170> PatentIn Ver. 2.0 



<210> 1 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: FLAG 

•<300> 

<301> Prickett et al., 
<303> BioTechniques 
<304> 7 
<305> 6 
<306> 580-584 
<307> 1989 

<400> 1 

Asp Tyr Lys Asp Asp Asp Lys 
1 5 



<210> 2 
<211> 9 
<212> PRT 

<213> Haemophilus influenzae 

<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: HA 

<300> 

<301> Xie et al. , 
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<303> Endocrinology 

<304> 139 

<305> 11 

<306> 4563-4567 

<307> 1998 

<400> 2 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 

<210> 3 
<211> 11 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: HA 1 

<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide corresponding to a sequence on an exposed 
loop region the X31 influenza HAl protein 

<300> 

<301> Nagelkerken et al., . 

<303> Electrophoresis 

<304> 18 

<306> 2694-2698 

<307> 1997 

<400> 3 

Cys Gin Asp Leu Pro Gly Asn Asp Asn Ser Thr 
1 - 5 10 



<210> 4 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<22q> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: c-Myc 

<220> 
<300> 

<301> Xie et al . , 

<3 03> Endocrinology 

<304> 139 

<305> 11 

<306> 4563-4567 

<307> 1998 

<400> 4 

Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
l" 5 10 



<210> 5 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Peptide fragment of fusion protein comprising 
epitope tag: 6 -His 

<220> 

<223> Description of Artificial Sequence: Histidine repeat epitope 
tag 

<400> 5 

His His His His His His 
1 5 



<210> 6 
<211> 5 
<212> PRT 

<213> Bovine papillomavirus 

<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: AUl 

<400> 6 

Asp Thr Tyr Arg He 
1 5 



<210> 7 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: EE 

<300> 

<.301> Tolbert and Lameh, 
<303> J. Neurochem. 
<304> 70 
<30'6> 113-119 
<307> 1998 

<400> 7 

Glu Tyr Met Pro Met Glu 
1 5 



<210> 8 
<211> 11 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Peptide sequence of fusion protein comprising 
epitope tag: T7 

<220> 

<223> Description of Artificial Sequence: T-7 Tag 
Sequence 

<300> 

<3 01> Chen and Katz 
<3 03> BioTechniques 
<304> 25 
<305> 1 
<306> 22-24 
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<307> 1998 
<300> 

<301> Tseng and Verma, 
<303> Gene 
<304> 169 
<306> 287-288 
<307> 1996 

<400> 8 

Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 
15 10 



<210> 9 
<211> 10 
<212> PRT 

<213> Mus musculus 

<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: 4A6 

<300> 

<3 01> Rudiger et al., 
<3 03> BioTechniques 
<304> 23 
<305> 1 
<306> 96-97 
<307> 1997 

<400> 9 

Ser Phe Pro Gin Phe Lys Pro Gin Glu He 
15 10 



<210> 10 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: 6 

<220> 

<223> Description of Artificial Sequence: peptide 
derived from last 12 amino acids of protein 
kinase C epsilon gene 



<300> 




<301> 


Olah et al . , 


<303> 


Biochemistry 


<304> 


221 


<306> 


94-102 


<307> 


1994 


<400> 


10 



Lys Gly Phe Ser Tyr Phe Gly Glu Asp Leu Met Pro 
15 10 



<210> 11 
<211> 6 
<212> PRT 

<213> Blue tongue virus 
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<220> 

<223> Peptide fragment of fusion protein comprising 
eptitope tag: B 

<300> 

<301> Wang et al . 
<303> Gene 
<304> 169 
<305> 1 
<306> 53-58 
<307> 1996 

<400> 11 

Gin Tyr Pro Ala Leu Thr 
1 5 • 



<210> 12 
<211> 11 
<212> PRT 

<213> Varicella Zoster Virus 

<220> 

<223> Peptide sequence of fusion protein comprising 
eptiope tag: gE 

<300> 

<310> 5,710,248 
<311> 1996-07-29 
<312> 1998-01-20 

<400> 12 

Gin Arg Gin Tyr Gly Asp Val Phe Lys Gly Asp 
15 10 



<210> 13 
<211> 10 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<223> Peptide fragment of fusion protein comprising 
epitope tag: Tyl 

<300> 

<301> Bastin et al . 

<303> Mol. iBiochem. Parasitol. 

<304> 77 

<306> 235-239 

<307> 1996 

<400> 13 

Glu Val His Thr Asn Gin Asp Pro Leu Asp 
1 5 • . 10 



<210> 14 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> T lynphocyte DNA binding protein: NF-At. sub.p 
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<300> 

<310> 5,656,452 
<311> 1993-10-29 
<312> 1997-08-12 

<400> 14 

Gly Ser Ser Ala Ser Phe lie Ser Asp Thr Phe Ser Pro Tyr Thr Ser 
15 10 15 

Pro Cys" Val Ser Pro Asn Asn Ala Gly Pro Asp Asp Leu Cys Pro Gin 

20 25 30 

Phe Gin Asn lie Pro Ala His Tyr Ser Pro Arg Thr Ser Pro lie Met 
35 40 45 

Ser Pro Arg Thr Ser Leu Ala'Glu Asp Ser Cys Leu Gly Arg His Ser 
50 55 60 

Pro Val Pro Arg Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg 
65 70 75 80 

Arg His Ser Cys Ala Glu Ala Leu Val Ala Pro Leu Pro Ala Ala Ser 

85 90 95 

Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Pro His Val Ala 
100 105 110 

Pro Gin Asp Asp Ser lie Pro Ala Gly Tyr Pro Pro Thr Ala Gly Ser 
115 120 125 

Ala Val Leu Met Asp Ala Leu Asn Thr Leu Ala Thr Asp Ser Pro Cys 
130 135 140 

Gly He Pro Ser Lys He Trp Lys Thr Ser Pro Asp Pro Thr Pro Val 
145 150 ' 155 160 

Ser Thr Ala Pro Ser Lys Ala Gly Leu Ala Arg His He Tyr Pro Thr 
165 - 170 175 

Val Glu Phe Leu Gly Pro Cys Glu Gin Glu Glu Arg Arg Asn Ser Ala 
180 185 190 

•Pro Glu Ser He Leu Leu Val Pro Pro Thr Trp Pro Lys Gin Leu Val 
195 200 205 

Pro Ala He Pro He Cys Ser He Pro Val Thr Ala Ser Leu Pro Pro 
210 ' 215 220 

Leu Glu Trp Pro Leu Ser Asn Gin Ser Gly Ser Tyr Glu Leu Arg He 
225 230 235 240 

Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly 
245 250 255 

Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val Val Gin 
260 265 270 

Leu His Gly Tyr Met Glu Asn Lys Pro Leu Gly Leu Gin He Phe He 
275 280 285 

Gly Thr Ala Asp Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val 

290 295 ' 300 

His Arg He Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys He 
305 310 315 320 
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Val Gly Asn Thr Lys Val Leu Glu lie Pro Leu Glu Pro Lys Asn Asn 
325 330 335 

Met Arg Ala Thr He Asp Cys Ala Gly He Leu Lys Leu Arg Asn Ala 
340 345 350 

Asp He Glu Leu Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr 
355 360 365 

Arg Val Arg Leu Val Phe Arg Val His Val Pro Glu Pro Ser Gly Arg 
370 375 380 

He Val Ser Leu Gin Ala Ala Ser Asn Pro He Glu Cys Ser Gin Arg 
385 390 395 400 

Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Met Asp Ser Cys 
405 410 415 

Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn Phe Thr 
420 425 430 

Ala Glu Ser Lys Val Val Phe Met Glu Lys Thr Thr Asp Gly Gin Gin 

435 440 445 

He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin Pro Asn 
450 455 460 

Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He Arg Val 
465 470 475 480 

Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys Arg Ser 
485 490 495 

Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala He Lys Thr Glu 
500 505 510 

Pro Ser Asp Glu Tyr Glu Pro Ser Leu He Cys Ser Pro Ala His Gly 
515 . 520 525 

Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Leu Ala Glu 
530 535 540 

Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg 
545 550 555 560 

Ser Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Ser Pro Ala 
565 570 575 

Ala Ala Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Gly Leu Leu Gly 
580 585 590 

Tyr Gin Gin Pro Ser Leu Leu Ala Ala Pro Leu Gly Leu Ala Asp Ala 
595 600 605 

His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Gly Gin Gly 
610 615 620 

Ser Thr Leu Arg His Thr Ser Ser Ala Ser Gin Gin Ala Ser Pro Val 
625 630 635 640 

He His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Gly Gly Gly His Gin 
645 650 655 



Glu Phe Gin His He Met Tyr Cys Glu Asn Phe Gly Pro Ser Ser Ala 
660 665 670 
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Arg Pro Gly Pro Pro Pro lie Asn Gin Gly Gin Arg Leu Ser Pro Gly 
675, 680 685 

Ala Tyr Pro Thr Val lie Gin' Gin Gin Thr Ala Pro Ser Gin Arg Ala 
690 695 700 

Ala Lys Asn Gly Pro Ser Asp Gin Lys Glu Ala Leu Pro Thr Gly Val 
705 710 715 720 

Thr Val Lys Gin Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp Asp Ala 
725 730 735 

Ala Thr Ser Glu Ser Trp Val Gly Thr Glu Arg Tyr lie Glu Arg Lys 
740 745 750 

Phe Trp Lys Lys Thr Leu Val Gin Pro Gly Leu Leu Pro Ser Phe Leu 
755 760 765 

Leu Leu Gly Ser Leu Ser Ala Gly Pro Arg Ser Gin Thr Pro Ser Glu 
770 775 780 

Arg Lys Pro lie Glu Glu Asp Val Pro Leu Ser Cys Ser Gin lie Ala 
785 790 795 800 

Trp Cys Cys Gin His Pro Leu Gly Thr Cys Pro Val Leu Pro Gly Pro 
805 810 815 

Leu Ala Val Glu Trp Trp Glu Gly Gin Leu Gly Arg Gly Leu Glu Pro 

820 825 830 

lie Pro Trp Ala Pro Asp Ser Ala Gly Ser Leu His Glu Val Asp Ser 
835 840 845 

Val Gly Leu Ala Gly Val Val Gly Met Val Leu Leu Thr Leu Met His 
850 855 860 

His Phe Ser Met Asp Gin Asn Gin Thr Pro Ser Pro His Trp Gin Arg 
865 870 875 880 

His Lys Glu Val Ala Ser Pro Gly Trp He 
885 890 

<210> 15 

<211> 29 

<212> DNA 

<213> Muridae sp. 

<220> 

<223> DNA sequence binding pair: NF-AT site 
<300> 

<310> 5,655,452 
<311> 1993-10-29 
<312> 1997-08-12 

<400> 15 

gcccaaagag gaaaatttgt ttcatacag 29 

<210> 16 
<211> 160 
<212> PRT 

<213> Homo sapiens 



<220> 
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<223> Human helix- loop-helix zipper protein DNA binding 
sequence: MAX 

<300> 

<310> 5,693,487 
<311> 1994-04-01 
<312> 1997-12-02 

<400> 16 

Met Ser Asp Asn Asp Asp lie Glu Val Glu Ser Asp Glu Glu Gin Pro 
1 5 * 10 15 

Arg Phe Gin Ser Ala Ala Asp Lys Arg Ala His His Asn Ala Leu Glu 
20 25 30 

Arg Lys Arg Arg Asp His lie Lys Asp Ser Phe His Ser Leu T^g I\sp 
35 ■ 40 45 

Ser Val Pro Ser Leu Gin Gly Glu Lys Ala Ser Arg Ala Gin lie Leu 
50 55 60 

Asp Lys Ala Thr Glu Tyr He Gin Tyr Met Arg Arg Lys Asn His Thr 
65 70 75 80 

His Gin Gin Asp He Asp Asp Leu Lys Arg Gin Asn Ala Leu Leu Glu 
as 90 95 

Gin Gin Val Arg Ala Leu Glu. Lys Ala Arg Ser Ser Ala Gin Leu Gin 

100 105 110 

Thr Asn Tyr Pro- Ser Ser Asp Asn Ser Leu Tyr Thr Asn Ala Lys Gly 
115 120 125 

Ser Tha; He Ser Ala Phe Asp Gly Gly Ser Asp Ser Ser Ser Glu Ser 
130 135 140 

Glu Pro Glu Glu Pro Gin Ser Arg Lys Lys Leu Arg Met Glu Ala Ser 
145 isO 155 160 



<210> 17 
<211> 533 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Human Egr Dna binding protein: zinc finger domain 
<300> 

<310> 5,866,325 
<311> 1995-06-06 
<312> 1999^02-02 

<400> 17 

Met Ala Ala Ala Lys Ala Glu Met Gin Leu Met Ser Pro Leu Gin He 
15 10 15 

Ser Asp Pro Phe Gly Ser Phe Pro His Ser Pro Thr Met Asp Asn Tyr 
20 25 30 



Pro Lys Leu Glu Glu Met Met Leu Leu Ser Asn Gly Ala Pro Gin Phe 
35 40 45 
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Leu Gly Ala Ala Gly Thr Pro Glu Gly Ser Gly Gly Asn Ser Ser Ser 
50 55 60 

•Ser Thr Ser Ser Gly Gly Gly Gly Gly Gly Gly Ser Asn Ser Gly Ser 
65 70 75 80 

Ser Ala Phe Asn Pro Gin Gly Glu Pro Ser Glu Gin Pro Tyr Glu His 
85 90 95 

Leu Thr Thr Glu Ser Phe Ser Asp lie Ala Leu Asn Asn Glu Lys Ala 
100 105 110 

Met Val Glu Thr Ser Tyr Pro Ser Gin Thr Thr Arg Leu Pro Pro lie 
115 120 125 

Thr Tyr Thr Gly Arg Phe Ser" Leu Glu Pro Ala Pro Asn Ser Gly Asn 

130 135 140 

Thr Leu Trp Pro Glu Pro Leu Phe Ser Leu Val Ser Gly Leu Val Ser 
145 150 155 160 

Met Thr Asn Pro Pro Thr Ser Ser Ser Ser Ala Pro Ser Pro Ala Ala 

165 170 175 

Ser Ser Ser Ser Ser Ala Ser Gin Ser Pro Pro Leu Ser Cys Ala Val 
180 185 190 

Pro Ser Asn Asp Ser Ser Pro He Tyr Ser Ala Ala Pro Thr Phe Pro 
195 200 205 

Thr Pro Asn Thr Asp He Phe Pro Glu Pro Gin Ser Gin Ala Phe Pro 
210 215 220 

Gly Ser Ala Gly Thr Ala Leu Gin Tyr Pro Pro Pro Ala Tyr Pro Ala 
225 230 235 240 

Thr Lys Gly Gly Phe Gin Val Pro Met He Pro Asp Tyr Leu Phe Pro 
245 250 255 

Gin Gin Gin Gly Asp Leu Ser Leu Gly Thr Pro Asp Gin Lys Pro Phe 
260 265 270 

Gin Gly Leu Glu Asn Arg Thr Gin Gin Pro Ser Leu Thr Pro Leu Ser 
275 280 285 

Thr He Lys. Ala Phe Ala Thr Gin Ser Gly Ser Gin Asp Leu Lys Ala 
290 295 300 

Leu Asn Thr Thr Tyr Gin Ser Gin Leu He Lys Pro Ser Arg Met Arg 
305 310 315 320 

Lys Tyr Pro Asn Arg Pro Ser Lys Thr Pro Pro His Glu Arg Pro Tyr 
325 330 335 

Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser Arg Ser Asp Glu 
340 345 350 

Leu Thr Arg His He Arg He His Thr Gly Gin Lys Pro Phe Gin Cys 
355 360 365 

Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His Leu Thr Thr His 
370 375 380 



He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys Asp He Cys Gly 
385 390 395 400 
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Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His Thr Lys lie His 

405 410 415 

Leu Arg Gin Lys Asp Lys Lys Ala Asp Lys Ser Val Val Ala Ser Pro 
420 425 430 

Ala Ala Ser Ser Leu Ser Ser Tyr Pro Ser Pro Val Ala Thr Ser Tyr 
435 440 445 

Pro Ser Pro Ala Thr Thr Ser Phe Pro Ser Pro Val Pro Thr Ser Tyr 
450 455 460 

Ser Ser Pro Gly Ser Ser Thr Tyr Pro Ser Pro Ala His Ser Gly Phe 
465 470 475 480 

Pro Ser Pro Ser Val Ala Thr Thr Phe Ala Ser Val Pro Pro Ala Phe 
485 490 495 

Pro Thr Gin Val Ser Ser Phe Pro Ser Ala Gly Val Ser Ser Ser Phe 
500 505 510 

Ser Thr Ser Thr Gly Leu Ser Asp Met Thr Ala Thr Phe Ser Pro Arg 
515 520 525 

Thr lie Glu He Cys 
530 

<210> 18 
<211> 224 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Human S 1-3 DNA binding protein: zinc finger 
domain 

<300> 

<310> 5,905,146 
<311> 1996-03-15 
<312> 1999-05-18 

•<400> 18 

Pro He Glu Val Cys Arg Ser Lys Leu Ser Lys Tyr Leu Gin Gly Val 

15 10 15 ^ 

Val Phe Arg Cys Asp Lys Cys Thr Phe Thr Cys Ser Ser Asp Glu Ser 
20 25 30 

Leu Gin Gin His He Glu Lys His Asn Glu Leu Lys Pro Tyr Lys Cys 
35 40 45 

Gin Leu Cys Tyr Tyr Glu Thr Lys His Thr Glu Glu Leu Asp Ser His 
50 55 60 

Leu Arg Asn Glu His Lys Val Ser Arg Asn Phe Glu Leu Val Gly Arg 
65 70 75 80 

Val Asn Leu Asp Gin Leu Glu Gin Met Lys Glu Lys Met Glu Ser Ser 
85 90 95 

Ser Ser Asp Asp Glu Asp Lys Glu Glu Glu Met Asn Ser Lys Ala Glu 
100 105 110 

Asp Arg Glu Leu Met Arg Phe Ser Asp His Gly Ala Ala Leu Asn Thr 
115 120 125 
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Glu Lys Arg Phe Pro Cys Glu Phe Cys' Gly Arg Ala Phe Ser Gin Ala 
130 135 140 

Ser Glu Trp Glu Arg His Val. Leu Arg His Gly Met Ala Leu Asn Asp 
145 150 155 160 

Thr Lys Gin Val Ser Arg Glu Glu lie His Pro Lys Glu lie Met Glu 
165 170 175 

Asn Ser Val Lys Met Pro Ser lie Glu Glu Lys Glu Asp Asp Glu Ala 
180 185 190 

lie Gly lie Asp Phe Ser Leu Lys Asn Glu Thr Val Ala lie Cys' Val 
195 200 205 

Val Thr Ala Asp Lys Ser Leu Leu Glu Asn Ala Glu Ala Lys Lys Glu 
210 215 .220 

<210> 19 
<211> 272 
<212>- RNA 
<213> Rattus sp. 

<220> 

<223> Untranslated region (UTR) of rat glucose 
transporter mRNA (GLUTl) 

<300> 

<310> 5,859,227 
<311> 1997-02-20 
<312> 1999-01-12 

<400> 19 

gg99cgggcc aauggcggcg guccuauaaa aaggcagcuc cgcgcgcucu cuuccuaaga 60 
acacaagaau cccuugugga gugucgguuu agguugcagg gucuuaagug agucagggcg 120 
cggagguccg gcgggagacg cauagucaca gaacguccau ucuccguuuc acagcccgca 180 
cagcuugagc cucgagcgca gcgcggccau ggagcccagc agcaagaagg ugacgggccg 24 0 
ccuuauguug gccgugggag gggcagugcu eg 272 



<210> 20 

<211> 108 

<212> RNA 

<213> Homo sapiens 

<220> 

<223> 5' untranslated region (UTR) of human 

3 -hydroxy- 3 -methyl -3 -glut aryl CoA reductase 
{HMG,CoA Red) 

<300> . 

<310> 5,859,227 
<311> 1997-02-20 
<312> 1999-01-12 

<400> 20 

uccuuccgcu ccgcgacugc guuaacugga, gccaggcuca gcgucggcgc cgggguucgg 60 



uggccucuag ugagaucugg aggauccaag gauucuguag cuacaaug 



108 
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<210> 21 

<211> 107 

<212> RNA 

<213> Homo sapiens 

<220> 

<223> 5' untranslated region (UTR) of human C4b-binding 
protein- alpha chain 

<300> 

<310> 5,859,227 
<311> 1997-02-20 
<312> 1999-01-12 

<400> 21 

gcugcuuuau uucugcuguu aaucauucau ugggcccguc aaaaguuucu gcccaucuau 60 
uuccaucaac cguccuugac cagccaacca cauggcugaa auucagg 107 



<210> 22 

<211> 174 

<212> RNA 

<213> Homo sapiens 

<220> 

<223> 5' untranslated region (UTR) of human CD45 
<300> 

<310> 5,859,227 
<311> 1997-02-20 
<312> 1999-01-12 

<400> 22 

cugacaucau caccuagcag uucaugcagc uagcaagugg uuuguucuua ggguaacaga 60 
ggaggaaauu guuccucguc ugauaagaca acaguggaga aaggacgcau gcaguuucuu 120 
^gsgacacgg cugacuucca gauaugacca uguauuugug gcuuaaacuc uugg 174 



<210> 23 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide G25 

representing the putative lipid-binding region (G 
region) of the erythrocyte Ca2+ pump 

<400> 23 

Lys Lya Ala Val Lys Val Pro Lys Lys Glu Lys Ser Val hen Gin Gly 
15 10 15 

Lys Leu Thr Arg Leu Ala Val Gin lie 
20 25 



<210> 24 

<211> 39 

<212> DNA 

<213> Artificial 



Sequence 



<220> 



wo 01/62968 PCTAJSOl/00452 

14 

<223> Description of Artificial Sequence: Primer 
oligonucleotide 

<220> 

<223> The NNS triplet degeneracy is repeated depending 
on the desired length of the complementarity 
determining region (CDR) . 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 24 

gtgtattatt gtgcgagann stggggccaa gggaccacg 39 



<210> 25 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 

peptide representing metal -binding protein surface 
domains from human histadine rich glycoprotieri 
(HRG) . 

<300> 

<3 03> J. Chroma togr, 
<304> 604 
<305> 1 
<306> 125-132 
<307> 1992 

<400> 25 

Gly His His Pro His 
1 5 



<210> 26 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mg (II) ion 
binding protein sequence 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 26 

Ser Arg Arg Ser Arg His His Pro Arg Met Trp Asn Gly Leu Asp Val 
15 10 15 



<210> 27 
<211> 16 
<212> PRT 

<213> Artificial Seqiuence 



<220> 

<223> Description of Artificial Sequence: Mg (II) ion 
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binding protein sequence. 

<300> 
'<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 27 

Gly Arg Phe Lys Arg Val Arg Asp Arg Trp Val Val lie Phe Asp Phe 
15 10 15 



<210> 28 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Mg (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 28 

Gly Val Ala Arg Ser Lys Lys Met Arg Gly Leu Trp Arg Leu Asp Val 
1 5 . 10 15 



<210> 29 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mg (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 29 

Gly Leu Ala Val Arg Ser Lys Arg Gly Arg Phe Phe Leu Phe Asp Val 
15 10 15 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CU (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 



<400> 30 

Gly Arg Val His His His Ser Leu Asp Val 
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15 10 



<210> 31 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenc : CU (11) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 31 

Ser Trp Lys His His Ala His Trp Asp Val 
15 10 



<210> 32 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial jSequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 32 

Gly Ser Trp Asp His Arg Gly Cys Asp Qly 
15 10 



<210> 33 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 33 

Gly His His Met Tyr Gly Qly Trp Asp His 
15 10 



<210> 34 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 34 

Gly His Trp Gly Arg His Ser Leu Asp Thr 
15 10 



<210> 35 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 35 

Gly His lie Leu His His Gin. Leu Asp Leu 
1 . 5 10 



<210> 36 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 36 

Ser Ser Gin Arg Leu Met Leu Gly Asp Asn 
15 10 



<210> 37 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 



<400> 37 



n 
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Ser His His Gly His His Tyr Leu Asn His 
• 1 5 10 



<210> 38 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein secjuence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 38 

Gly Lys Leu Met Met Ser Trp Cys Arg Asp Thr Glu Gly Cys Asp His 
1 5 10 15 



<210> 39 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 39 

Gly Asp Thr His Arg Gly His Leu Arg His His Leu Pro His Asp Trp 
1 - 5 10 15 



<210> 40 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cu (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 40 

Gly Trp Gly Leu Trp Met Lys Pro Phe Val Trp Arg Ala Trp Asp Met 
15 10 15 

<210> 41 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Zn (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> .1997-10-21 

<400> 41 

Gly Arg Val His His His Ser Leu Asp Val 
1-5 10 



<210> 42 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 2n (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 42 

Ser His Thr His Ala Leu Pro Leu Asp Phe 
15 10 



<210> 43 
<211> 10 . 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Zn (II) ion 
binding protein secjuence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 43 

Gly Gin Ser Ser Gly Gly Asp Thr Asp Asp 
15 10 



<210> 44 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Zn (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 
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<400> 44 

Gly Gin Trp Thr Pro Arg Gly Asp Asp Phe 
15 10 



<210> 45 
<211> 10 
<212> PRT 
•<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Zn (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 45 

Gly Arg Cys Cys Pro Ser Ser Cys Asp Glu 
1 5 10 



<210> 46 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Zn (II) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-rl0-21 

<400> 46 

Gly Pro Ala Lys His Arg His Arg His Val Gly Gin Met His Asp Ser 
15 10 15 



<210> 47 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 47 

Gly Asn Leu Arg Arg Lys Thr Ser Asp lie 
15 10 



<210> 48 
<211> 10 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence, 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 48 

Gly Glu Ser Asp Ser Lys Arg Glu Asp Gly 
15 10 



<210> 49 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 49 

Gly Gly Pro Ser Leu Ala Val Gly Asp Trp 
15 10 



<210> SO 
<;211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 50 

Gly Pro LeU Gin His Thr Tyr Pro Asp Tyr 
15 10 



<210> 51 
<211> .16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
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<312> 1997-10-21 
<400> 51 

Gly Trp Lys Val Thr Ala Glu Asp Ser Thr Glu Gly Leu Phe Asp Leu 
15 10 15 



<210> 52 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 52 

Gly Thr Arg Val Trp Arg Val Cys Gin Trp Asn His Glu Glu Asp Gly 
15 10 15 



<210> 53 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 53 

Gly Glu Trp Trp Cys Ser Phe Ala Met Cys Pro Ala Arg Trp Asp Phe 
1 5 10 . . 15 ' 



<210> 54 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Pb (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 54 

Gly Asp Thr lie Phe Gly Val Thr Met Gly Tyr Tyr Ala Met Asp Val 
15 10 15 



<210> 55 
<211> 10 
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<212> PRT - 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

«:310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 55 

Gly Gin Val Met Gin Glu Leu Gly Asp Ala 
15 10 



<210> 56 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 56 

Gly Leu Thr Glu Gin Gin Leu Gin Asp Gly 
1 5 • 10 



<210> 57 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 57 

Gly Tyr Ser Tyr Ser Val Ser Pro Asp Ala 
15 10 



<210> 58 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
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<311> 1993-06-14 
<312> 1997-10-21 

<400> 58 

Gly Arg Leu Gly Leu Val Met Thr Asp Glu 
1 5 10 



<210> 59 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5;679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 59 

Ser Thr Trp Pro Gly Arg Gin Arg Leu Gly Gin Ala Leu Ser Asp Ser 
15 10 15 



<210> 60 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 60 

Gly Tyr Glu Leu Ser Trp Gly Val Asp Gin Gin Glu Trp Trp Asp lie 
15 10 15 



<210> 61 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 61 

Gly Pro Val Arg Gly Leu Asp Gin Ser Lys Gly Val Arg Tyr Asp Asn 
1 5 . 10 15 



<210> 62 
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<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 62 

Gly Leu Ser Gin His lie Val Ser Glu Thr Gin Ser Ser Gly Asp Leu 
15 10 15 



<210> 63 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5;679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 63 

Gly Leu Glu Ser Leu Lys Val Leu Gly Val Gin Leu Gly Gly Asp Leu 
15 10 15 



<210> 64 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 64 

Gly Asn Met He Leu Gly Gly Pro Gly Cys Trp Ser Ser Ala Asp He 
1 5 . 10 15 



<210> 65 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ce (III) ion 
binding protein sequence. 



<300> 
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<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 65 

Gly Cys Trp Asn Val Gin Arg Leu Val Val Tyr His Pro Pro Asp Gly 
15 10 15 



<210> 66 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
.<220> 

<223> Description of Artificial Sequence; Ce (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 66 

Gly Phe Glu Val Thr Cys Ser Trp Phe Gly His Trp Gly Arg Asp Ser 
15 10 15 



<210> 67 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Pe (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 67 

Ser Ala Ser Met Arg Ser Ala lie Gly Leu Trp Arg Thr Met Asp Tyr 
1 ' 5 10 15 



<210> 68 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fe (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 68 

Gly Asp Arg Glu He Phe His Met Gin Trp Pro Leu Arg Val Asp Val 
15 10 15 
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<210> 69 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fe (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 69 

Ser Gin Asn Pro Gin Gin Val Cys Gly Val Arg Cys Gly Gin Asp Lys 
15 10 15 



<210> 70 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Fe (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 70 

Gly Asn Arg Leu Ser Ser Gly His Leu Leu Lys Gin Gly Gin Asp Gly 
15 10 15 



<210> 71 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Fe (III) ion 
binding protein secpience. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 71 

Gly Gly Ser Asp Trp Gin lie Gly Ala Cys Cys Arg Glu Asp Asp Leu 
15 10 15 



<210> 72 
<211> 16 
<212> PRT 

•<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fe (III) ion 
binding protein sequence. 
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<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 72 

Gly Met Val Ser Met Met Gly Gin Ser Arg Pro Thr Gin Cys Asp Cys 
15 10 15 



<210> 73 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fe (III) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 . 
<312> 1997-10-21 

<400> 73 

Gly Val He hya Trp He Arg Arg Trp Val Arg Thr Ala Arg Asp Val 
1 5 • 10 15 



<210> 74 
<211> 16 
<212> PRT 

-<213> Artificial Sequence 
<220> . 

<223> Description of Artificial Sequence: Fe (HI) ion 
binding protein sequence. 

<300> 

<310> 5,679,548 
<311> 1993-06-14 
<312> 1997-10-21 

<400> 74, 

Gly Trp Phe Trp Arg Leu Leu Pro Thr Pro Arg Ala Pro Ser Asp Val 
15 10 15 



<210> 75 
<211> 909 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (74) . . (541) 

<220> 

<223> Human glutathione S- transferase cDNA 

<300> 

<308> J03746/GenBank 
<400> 75 

gaattcaagt cctaaagcct acagttttga atactactga aatgacaagt tattccagac 60 
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caaaattgaa aaa atg gtt gac etc acc cag gta atg gat gat gaa gta 109 
Met Val Asp Leu Thr Gin Val Met Asp Asp Glu Val 
15 10 

ttc atg get ttt gca tec tat gca aca att att ctt tea aaa atg atg 157 
Phe Met Ala Phe Ala Ser Tyr Ala Thr lie lie Leu Ser Lys Met Met 
15 20 25 

ctt atg agt act gca act gca ttc tat aga ttg aca aga aag gtt ttt 205 
Leu Met Ser Thr Ala Thr Ala Phe Tyr Arg Leu Thr Arg Lys Val Phe 
30 35- 40 

gee aat cca gaa gac tgt gta gca ttt ggc aaa gga gaa aat gcc aag 253 
Ala Asn Pro Glu Asp Cys Val Ala Phe Gly Lys Gly Glu Asn Ala Lys 
45 50 55 60 

aag tat ctt cga aca gat gac aga gta gaa cgt gta cgc aga gcc cac 301 
Lys Tyr Leu Arg Thr Asp Asp Arg Val Glu Arg Val Arg Arg Ala His 
65 70 75 

ctg aat gac ctt gaa aat att att cca ttt ctt gga att ggc etc ctg 349 
Leu Asn Asp Leu Glu Asn lie He Pro Phe Leu Gly He Gly Leu Leu 
80 85 90 

tat tec ttg agt ggt ecc gac cec tct aca gcc ate ctg cac ttc aga 397 
Tyr Ser Leu Ser Gly Pro Asp Pro Ser Thr Ala He Leu His Phe Arg 
95 100 105 

eta ttt gtc gga gca egg ate tac cac acc att gca tat ttg aca cec 445 
Leu Phe Val Gly Ala Arg He Tyr His Thr He Ala Tyr Leu Thr Pro 
110 115 120 

ctt cec cag cca aat aga get ttg agt ttt ttt gtt gga tat gga gtt 493 
Leu Pro Gin Pro Asn Arg Ala, Leu Ser Phe Phe Val Gly Tyr Gly Val 
125 130 135 140 

act ctt tec atg get tac agg ttg ctg aaa agt aaa ttg tac ctg taa 541 
Thr Leu Ser Met Ala Tyr Arg Leu Leu Lys Ser Lys Leu Tyr Leu 





145 




150 




155 




agaaaatcat 


acaactcaac 


atceagttgg 


etttttaaga 


attctgtact 


tceaatttat 


601 


aatgaatact 


ttcttagatt 


ttaggtagga 


sgggagcaga 


ggaattatga 


actggggtaa 


661 


acccattttg 


aatattagea 


ttgccaatat 


cctgtattct 


tgttttacat 


ttggattaga 


721 


aatttaacat 


agtaattctt 


aagtcttttg 


tctgattttt 


aaagtacttt 


cttataaatt 


781 


tggatcatgt 


tatgatttgt 


aacattcaca 


caacacctca 


cttttgaatc 


tataaaagaa 


841 


ttgcacgtat 


gagaaaccta 


tatttcaata 


ctgctgaaac 


agacatgaaa 


taaagaattt 


901 


aaagaatg 












909 



<210> 76 
<211> 155 
<212> PRT 

<213> Homo sapiens. 



<400> 76 

Met Val Asp Leu Thr Gin Val Met Asp Asp Glu Val Phe Met Ala Phe 
1 5 10 15 • 

Ala Ser Tyr Ala Thr He He Leu Ser Lys Met Met Leu Met Ser Thr 
20 25 30 
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Ala Thr Ala Phe Tyr Arg Leu Thr Arg Lys Val Phe Ala Asn Pro Glu 
35 40 45 

Asp Cys Val Ala Phe Gly Lys* Gly Glu Asn Ala Lys Lys Tyr Leu Arg 
50 - 55 60 

Thr Asp Asp Arg Val Glu Arg Val Arg Arg Ala His Leu Asn Asp Leu 
65 70 75 80 

Glu Asn lie lie Pro Phe Leu Gly lie Gly Leu Leu Tyr Ser Leu Ser 
85 90 95 

Gly Pro Asp Pro Ser Thr Ala lie Leu His Phe Arg Leu Phe Val Gly 
100 105 110 

Ala Arg lie Tyr His Thr He Ala Tyr Leu Thr Pro Leu Pro Gin Pro 
115 120 125 

Asn Arg Ala Leu Ser Phe Phe Val Gly Tyr Gly Val Thr Leu Ser Met 
130 135 140 

Ala Tyr Arg Leu Leu Lys Ser Lys Leu Tyr Leu 
145 150 155 



<210> 77 
<211> 4 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Tetrapeptide 
prior to cleavage site of Factor Xa recognition 
sequence . 

<300> 

<3 03> Nature 
<304> 309 
<305> 810 
<307> 1984 

<400> 77 
He Glu Gly Arg 
1 



<210> 78 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PreScission 
Protease recognition sequence. 

<220> 

<223> PreScission Protease cleaves between Gin and Gly. 
<300> 

<303> J. Biol. Chem. 
<304> 265 
<306> 9062 
<307> 1990 



<400> 78 
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PCT/USOl/00452 



Leu Glu Val Leu Phe Gin Gly Pro 
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